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RECOMBINANT DNA ENCODING A DESULFURI Z ATION BIOCATALYST 

PftcyqpouND 

Sulfur contaminants in fossil fuels can create pro- 
blems in refinery processes which can be costly to recti- 
5 fy. The sulfur contaminants that occur in fossil fuels 
fall into either of the following general classes: miner- 
alized (inorganic, e.g., pyritic) sulfur and organic 
sulfur (sulfur that is covalently bound to carbonaceous 
molecules, referred to as organosulfur compounds). The 

10 presence of sulfur has been correlated with corrosion of 
pipeline, pumping and refining equipment, and with pre- 
mature breakdown of combustion engines. Sulfur also 
poisons many catalysts which are used in the refining of 
fossil fuels. Moreover, the atmospheric emission of 

15 sulfur combustion products, such as sulfur dioxide, leads 
to the form of acid deposition known as acid rain. Acid 
rain has lasting deleterious effects on aquatic and forest 
ecosystems, as well as on agricultural areas located 
downwind of combustion facilities. Monticello, D.J. and 

20 w.R. Finnerty, (1985) Ann. Rev. Microbiol. 39:37i-irq. 

Regulations such as the Clean Air Act of 1964 require the 
removal of sulfur, either pre- or post-combustion, from 
virtually all coal- and petroleum-based fuels. Conformity 
with such legislation has become increasingly problematic 

25 due to the rising need to utilize lower grade, higher- 
sulfur fossil fuels as clean-burning, low-sulfur petroleum 
reserves become depleted, as well as the progressive 
reductions in sulfur emissions required by regulatory 
authorities. Monticello, D.J. and J.J. Kilbane, "Practi- 

30 cal Considerations in Biodesulfurization of Petroleum 11 , 

I£Xi§ 3d Xptl. Symp. on Gas. Oil. Coal, and Env. Biotech. . 
(Dec. 3-5, 1990) New Orleans, LA. 

One technique which is currently employed for the 
pre-combustion removal of organic sulfur from liquid 
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fossil fuels, e.g., petroleum, is hydrodesulf urization 
(HDS). HDS is suitable for the desulfuri2ation of fossil 
fuels wherein organosulfur compounds account for a signi- 
ficant, e.g., a major, proportion of all sulfur contain- 
5 inants present. HDS is thus useful for treating crude oil 
or bitumen, petroleum distillate fractions or refining 
intermediates, liquid motor fuels, and the like. HDS is 
more particularly described in Shih, S.S. et aL , "Deep 
Desulfurization of Distillate Components", Abstract No. 

10 264B AIChE Chicago Annual Meeting, presented November 12, 
1990, (complete text available upon request from the 
American Institute of Chemical Engineers) ; Gary, J.H. and 
G.E. Handwerk, (197 5) Petroleum Refining: Technology and • 
Economics , Marcel Dekker, Inc., New York, pp. 114-120, and 

15 Speight, J.G., (1981) The Desulfurization of Heavy Oils 
and Residue . Marcel Dekker, Inc.,. New York, pp. 119-127. 
HDS is based on the reductive conversion of organic sulfur 
into hydrogen sulfide (H 2 S) in the presence of a metal 
catalyst. HDS is carried out under conditions of elevated 

20 temperature and pressure. The hydrogen sulfide produced 
as a result of HDS is a corrosive gaseous substance, which 
is stripped from the fossil fuel by known techniques. 
Elevated or persistent levels of hydrogen sulfide are 
known to poison (inactivate) the HDS catalyst, complicat- 

25 ing the desulfurization of liquid fossil fuels that are 
high in sulfur. 

Organic sulfur in both coal and petroleum fossil 
fuels is present in a myriad of compounds, some of which 
are termed labile in that they can readily be desulfur- 

3 0 ized, others of which are termed refractory in that they 
do not easily yield to conventional desulfurization treat- 
ment, e.g., by HDS. Shih, S.S. et al. Frequently, then, 
even HDS-treated fossil fuels must be post-combustively 
desulfurized using an apparatus such as a flue scrubber. 

35 Flue scrubbers are expensive to install and difficult to 
maintain, especially for small combustion facilities. 
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Moreover, of the sulfur-generated problems noted above, 
the use of flue scrubbers in conjunction with HDS is 
directed to addressing environmental acid deposition, 
rather than other sulfur-associated problems, such as 
5 corrosion of machinery and poisoning of catalysts. 

Recognizing these and other shortcomings of HDS, many 
investigators have pursued the development of microbial 
desulfurization (MDS) . MDS is generally described as the 
harnessing of metabolic processes of suitable bacteria to 
10 the desulfurization of fossil fuels. Thus, MDS typically 
involves mild (e.g., ambient or physiological) conditions, 
and does not involve the extremes of temperature and 
pressure required for HDS. It is also generally con- 
sidered advantageous that biological desulfurizing agents 
15 can renew or replenish themselves under suitable con- 
ditions. Microbial desulfurization technology is reviewed 
in Monticello and Finnerty (1985), 39 ANN. REV. MICROBIOL. 
371-389 and Bhadra et al. (1987), 5 BIOTECH. ADV. 1-27. 
Hartdegan et al. (1984), 5 CHEM. ENG. PROGRESS 63-67 and 
20 Kilbane (1989), 7 TRENDS BIOTECHNOL. (No. 4) 97-101 pro- 
vide additional commentary on developments in the field. 

Several investigators have reported mutagenizing 
naturally-occurring bacteria into mutant strains with the 
acquired capability of breaking down, i.e., catabolizing, 
25 dibenzothiophene (DBT) . Hartdegan, F.J. et al., (May 

1984) Chem. En^. Progress 63-67. DBT is representative of 
the class of organic sulfur molecules found in fossil 
fuels from which it is most difficult to remove sulfur by 
HDS. Most of the reported mutant microorganisms act upon 
30 DBT nonspecifically, by cleaving carbon-carbon bonds, 
thereby releasing sulfur in the form of small organic 
breakdown products. One consequence of this microbial 
action is that the fuel value of a fossil fuel so treated 
is degraded. Isbister and Doyle, however, reported the 
3 5 derivation of a mutant strain of Pseudomonas which ap- 
peared to be capable of selectively liberating sulfur from 
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DBT, thereby preserving the fuel value of treated fossil 
fuels. U.S. Patent No. 4,562,156. 

Kilbane recently reported the mutagenesis of a mixed 
bacterial culture, producing a bacterial consortium which 
5 appeared capable of selectively liberating sulfur from DBT 
by an oxidative pathway. Resour ■ Cons. Reeve 1 . 3:69-7 9 
(1990). A strain of Rhbdococcus rhodocrous was sub- 
sequently isolated from the consortium. This strain, 
which has been deposited with the American Type Culture 

10 Collection under the terms of the Budapest Treaty as ATCC 
No. 53968 and also referred to as IGTS8, is a source of 
biocatalytic activity as described herein. Microorganisms 
of the ATCC No. 53968 strain liberate sulfur from forms of* 
organic sulfur known to be present in fossil fuels, in- 

15 eluding DBT, by the selective, oxidative cleavage of 

carbon-sulfur bonds in organic sulfur molecules. Kilbane 
has described the isolation and characteristics of this 
strain in detail in U.S. Patent No. 5,104,801. 

SUMMARY OF THE INVENTION 

20 This invention relates in one aspect to a deoxyribo- 

nucleic acid (DNA) molecule containing one or more genes 
encoding one or more enzymes that, singly or in concert 
with each other, act as a biocatalyst that desulfurizes a 
fossil fuel that contains organic sulfur molecules. The 

25 DNA molecule of the present invention can be purified and 
isolated from a natural source, or can be a fragment or 
portion of a recombinant DNA molecule that is, e.g., 
integrated into the genome of a non-human host organism. 
The gene or genes of the present invention can be obtained 

30 from, e.g., a strain of Rhodococcus rhodochrous micro- 
organisms having suitable biocatalytic activity. That is, 
the gene or genes of the present invention can be obtained 
from a non-human organism, e.g., a microorganism, that 
expresses one or more enzymes that, singly or in concert 

35 with each other, act as a desulfurizing biocatalyst. 
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Biocatalysis, as described more fully below, is the selec- 
tive oxidative cleavage of carbon-sulfur bonds in organo- 
sulfur compounds. The present invention is particularly 
useful for the desulfurization of fossil fuels that con- 
5 tain organosulfur compounds, e.g., DBT. 

The invention further relates to recombinant DNA 
vectors, recombinant DNA plasmids and non-human organisms 
that contain foreign (recombinant, heterologous) DNA 
encoding a biocatalyst capable of desulfurizing a fossil 

10 fuel which contains organosulfur compounds. Such organ- 
isms are referred to herein as host organisms. 

The invention described herein thus encompasses 
ribonucleic acid (RNA) transcripts of the gene or genes of 
the present invention, as well as polypeptide expression 

15 product(s) of the gene or genes of the present -invention. 
The present polypeptide expression products, after such 
post-translational processing and/or folding as is neces- 
sary, and in conjunction with any coenzymes, cof actors or 
coreactants as are necessary, form one or more protein 

2 0 biocatalysts (enzymes) that, singly or in concert with 

each other, catalyze (promote, direct or facilitate) the 
removal of sulfur from organosulfur compounds that are 
found in fossil fuels. This is accomplished by the selec- 
tive, oxidative cleavage of carbon-sulfur bonds in said 
25 compounds. The biocatalyst of the present invention can 
be a non-human host organism, viable (e.g., cultured) or 
non-viable (e.g., heat-killed) containing the DNA of the 
present invention and expressing one or more enzymes 
encoded therein, or it can be a cell-free preparation 

3 0 derived from said organism and containing said one or more 

biocatalytic enzymes. 

In another aspect, the present invention relates to a 
method of desulfurizing a fossil fuel using the above 
mentioned non-human organism, said organism expressing a 
35 desulfurizing biocatalyst. Alternatively, the present 
invention relates to a method of desulfurizing a fossil 
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fuel using a biocatalyst preparation comprising one or 
more enzymes isolated from said organism. The process 
involves: 1) contacting said organism or biocatalyst 
preparation obtained therefrom with a fossil fuel that 
5 contains organic sulfur, such that a mixture is formed; 
and 2) incubating the mixture for a sufficient time for 
the biocatalyst expressed by or prepared from the organism 
to desulfurize the fossil fuel. The biocatalytically 
treated fossil fuel obtained following incubation has 
10 significantly reduced levels of organosulfur compounds, 

compared to a sample of the corresponding untreated fossil 
fuel . 

In yet another aspect, the invention relates to 
nucleic acid probes which hybridize to the recombinant DNA 

15 of the present invention. 

In still other aspects, the present invention relates 
to the production of new non-human organisms containing 
the recombinant DNA of the present invention and prefer- 
ably expressing the biocatalyst encoded therein. Avail- 

20 ability of the recombinant DNA of this invention greatly 
simplifies and facilitates the production and purification 
of biocatalysts for desulf urizing a fossil fuel. Costly 
and time consuming procedures involved in the purification 
of biocatalysts can be reduced, eliminating the need to 

25 generate the biocatalyst from, one or more non-human organ- 
isms in which it is naturally present or has been produced 
by mutagenesis. More specifically, non-human host organ- 
isms can be generated which express the gene or genes of 
the present invention at elevated levels. In addition, 

30 the invention described herein furthers the discovery of 
genes encoding desulfurization biocatalysts in additional 
non-human organisms. This objective can be accomplished 
using the nucleic acid probes of the present invention to 
screen DNA libraries prepared from one or more additional 

35 non-human organisms in whom biocatalytic function is known 
or suspected to be present. Any genes present in such 
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organisms and encoding desulfurization biocatalysts or 
components thereof can be replicated at large scale using 
known techniques, such as polymerase chain reaction (PCR) . 
PCR advantageously eliminates the need to grow the non- 
5 human organisms, e.g., in culture, in order to obtain 
large amounts of the DNA of interest. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 is a flow diagram schematic illusting a 
stepwise procedure for the isolation of the recombinant 
10 DNA of the present invention. 

Figure 2 is a schematic illustration of the Rhodo- 
coccus rhodochrous replication competent and chloram- 
phenicol resistant vector pRF29, said vector having been 
derived from Rhodococcus fascians . 
15 Figure 3 is a schematic illustration of the Rhodo- 

coccus rhodochrous replication competent and chloram- 
phenicol resistant vector pRR-6. 

Figure 4 is a schematic illustration of the restric- 
tion map for DNA plasmid pTOXI-1 encoding a biocatalyst 
20 capable of carbon-sulfur bond cleavage. 

Figure 5 is a schematic illustration of the restric- 
tion map for subclone pMELV-l, derived from plasmid pTOXI- 
1. 

Figure 6 is a schematic illustration of the restric- 
2 5 tion map for pMELV-l and fragments thereof present as 

inserts in subclones pSMELV-lA, pSMELV-2A, pSMELV-3A and 
PSMELV-4A. 

Figure 7 is a schematic illustration of the predicted 
locations within the sequence of pTOXI-l of three nearly 
30 contiguous open reading frames (ORFs; specifically, ORF 1, 
ORF 2 and ORF 3) encoding polypeptide expression products 
responsible for the Dsz+ phenotype. 

Figure 8 is a schematic illustration of the restric- 
tion map of pTOXI-l and fragments thereof present as 
35 inserts in subclones pENOK-1, pENOK-2, pENOK-3, pENOK-11, 
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pENOK-13, pENOK-16, pENOK-18, pENOK- Nsi, pENOK-19 AND 
pENOK-20. 

Figure 9 is a schematic illustration of the restric- 
tion map of pRR-12. 
5 Figure 10 is a schematic illustration of the restric- 

tion map of vector pKAMI. In the inset, the engineered 
cloning site present in pKAMI is shown in detail. 

Figure 11 is a schematic illustration of the restric- 
tion map of pSBG-2, in which expression of a promoterless 
10 Dsz gene cluster from pTOXI-1 is driven by the chloram- 
phenicol resistance promoter. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 
In the petroleum extraction and refining arts, the 
term "organic sulfur" is generally understood as referring 

15 to organic molecules having a hydrocarbon framework to 
which one or more sulfur atoms (called heteroatoms) are 
covalently joined. These sulfur atoms can be joined 
directly to the hydrocarbon framework, e.g., by one or 
more carbon-sulfur bonds, or can be present in a sub- 

2 0 stituent joined to the hydrocarbon framework of the mole- 
cule, e.g., a sulfonyl group (which contains a carbon- 
oxygen-sulfur covalent linkage) . The general class of 
organic molecules having one or more sulfur heteroatoms 
are sometimes referred to as "organosulf ur compounds". 

25 The hydrocarbon portion of these compounds can be ali- 
phatic, aromatic, or partially aliphatic and partially 
aromatic. 

Cyclic or condensed multicyclic organosulfur com- 
pounds in which one or more sulfur heteroatoms are linked 

30 to adjacent carbon atoms in the hydrocarbon framework by 
aromatic carbon-sulfur bonds are referred to as "sulfur- 
bearing heterocycles". The sulfur that is present in many 
types of sulfur-bearing heterocycles is referred to as 
"thiophenic sulfur" in view of the f ive-membered aromatic 

35 ring in which the sulfur heteroatom is present. The 
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simplest such sulfur-bearing heterocycle is thiophene, 
which has the composition C 4 H 4 S. 

Sulfur-bearing heterocycles are known to be stable to 
conventional desulfurization treatments, such as HDS. For 
5 this reason, they are said to be refractory or recalci- 
trant to HDS treatment. Sulfur-bearing heterocycles can 
have relatively simple or relatively complex chemical 
structures. In complex heterocycles, multiple condensed 
aromatic rings, one or more of which can be heterocyclic, 

10 are present. The difficulty of desulfurization increases 
with the structural complexity of the molecule, Shih et 
al . That is, refractory behavior is most accentuated in 
complex sulfur-bearing heterocycles, such as dibenzo- 
thiophene (DBT, C 12 H 8 S) . 

15 DBT is a sulfur-bearing heterocycle that has a con- 

densed, multiple aromatic ring structure in which a five- 
membered thiophenic ring is flanked by two six-membered 
benzylic rings. Much of the residual post-HDS organic 
sulfur in fossil fuel refining intermediates and com- 

2 0 bustible products is thiophenic sulfur. The majority of 
this residual thiophenic sulfur is present in DBT and 
derivatives thereof having one or more alkyl or aryl 
radicals attached to one or more carbon atoms present in 
one or both flanking benzylic rings. Such DBT derivatives 

25 are said to be "decorated" with these radicals. DBT 

itself is accepted in the relevant arts as a model com- 
pound illustrative of the behavior of the class of com- 
pounds encompassing DBT and alkyl- and/or aryl-decorated 
derivatives thereof in reactions involving thiophenic 

30 sulfur. Monticello and Finnerty (1985), Microbial desul- 
furization of fossil fuels, 39 ANNUAL REVIEWS IN MICRO- 
BIOLOGY 371-389, at 372-373. DBT and radical-decorated 
derivatives thereof can account for a significant per- 
centage of the total sulfur content of particular crude 

35 oils, coals and bitumen. For example, these sulfur-bear- 
ing heterocycles have been reported to account for as much 



WO 94/01563 



PCT/US93/06497 



-10- 

as 70 wt% of the total sulfur content of West Texas crude 
oil, and up to 40 wt% of the total sulfur content of some 
Middle East crude oils. Thus, DBT is considered to be 
particularly relevant as a model compound for the forms of 
5 thiophenic sulfur found in fossil fuels , such as crude 
oils , coals or bitumen of particular geographic origin, 
and various refining intermediates and fuel products 
manufactured therefrom. Id. Another characteristic of 
DBT and radical-decorated derivatives thereof is that, 

10 following a release of fossil fuel into the environment, 
these sulfur-bearing heterocycles persist for long periods 
of time without significant biodegradation. Gundlach et 
Alx (1983), 221 SCIENCE 122-129. Thus, most prevalent 
naturally occuring microorganisms do not effectively 

15 metabolize and break down sulfur-bearing heterocycles. 

A fossil fuel that is suitable for- desulfurization 
treatment according to the present invention is one that 
contains organic sulfur. Such a fossil fuel is referred 
to as a "substrate fossil fuel". Substrate fossil fuels 

20 that are rich in thiophenic sulfur (wherein a significant 
fraction of the total organic sulfur is thiophenic sulfur, 
present in sulfur-bearing heterocycles, e.g., DBT) are 
particularly suitable for desulfurization according to the 
method described herein. Examples of such substrate 

25 fossil fuels include Cerro Negro or Orinoco heavy crude 

oils; Athabascan tar and other types of bitumen; petroleum 
refining fractions such as light cycle oil, heavy atmos- 
pheric gas oil, and No. 1 diesel oil; and coal-derived 
liquids manufactured from sources such as Pocahontas #3, 

30 Lewis-Stock, Australian Glencoe or Wyodak coal. 

Biocatalytic desulfurization (biocatalysis or BDS) is 
the excision (liberation or removal) of sulfur from or- 
ganosulfur compounds, including refractory organosulfur 
compounds such as sulfur-bearing heterocycles, as a result 

35 of the selective, oxidative cleavage of carbon-sulfur 

bonds in said compounds by a biocatalyst. BDS treatment 



# 
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yields the desulfurized combustible hydrocarbon framework 
of the former refractory organosulfur compound, along with 
inorganic sulfur — substances which can be readily sepa- 
rated from each other by known techniques such as fra- 
tional distillation or water extraction. For example, DBT 
is converted into hydroxybiphenyl or dihydroxybiphenyl , or 
a mixture thereof, when subjected to BDS treatment. BDS 
is carried out by a biocatalyst comprising one or more 
non-human organisms (e.g., microorganisms) that . function- 
ally express one or more enzymes that direct, singly or in 
concert with each other, the removal of sulfur from or- 
ganosulfur compounds, including sulfur-bearing hetero- 
cycles, by the selective cleavage of carbon-sulfur bonds 
in said compounds; one or more enzymes obtained from such 
15 microorganisms; or a mixture of such microorganisms and 

enzymes. Organisms that exhibit biocatalytic activity are 
referred to herein as being CS+ or Dsz+. Organisms that 
lack biocatalytic activity are referred to herein as being 
CS- or Dsz-. 

20 As summarized above, the invention described herein 

relates in one aspect to a DNA molecule or fragment there- 
of containing a gene or genes which encode a biocatalyst 
capable of desulfurizing a fossil fuel that contains 
organosulfur compounds. The present DNA molecule or 
25 fragment thereof can be purified and isolated DNA obtained 
from, e.g., a natural source, or can be recombinant (het- 
erologous or foreign) DNA that is, e.g., present in a non- 
human host organism. The following discussion, which is 
not to be construed as limiting on the invention in any 
way but is presented for purposes of illustration, re- 
counts the isolation of DNA encoding a desulfurizing 
biocatalyst from a strain of Rhodococcus rhodochrous . ATCC 
No. 53 968, that is known to express suitable biocatalytic 
activity. This preferred strain of Rhodococcus rhodnnrniK 
35 is disclosed in U.S. Patent No. 5,104,801 (issued 1992), 
the teachings of which are incorporated herein by refer- 



30 
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ence, and has been referred to in the literature as IGTS8. 
IGTS8 was developed by investigators at the Institute of 
Gas Technology in Chicago IL. Other organisms that are 
known to express suitable biocatalytic activity and thus 
5 are viewed as suitable sources of the DNA of the present 
invention include the strain of Bacillus sulf asportare 
described in U.S. Patent 5, 002,888 and deposited with the 
American Type Culture Collection as ATCC No. 53969 and the 
Corvnebacterium strain described in Omori et al . (1992) , 

10 Desulfurization of dibenzothiophene bv Corvnebacterium sp. 
strain 5Y1. 58 APPL. ENV. MICROBIOL. (No. 3) 911-915. The 
isolation of the DNA of the present invention from the 
ATCC No. 53968 microorganism is schematically depicted in 
Figure 1, and will now be described. 

15 Mutant strains of R. rhodochrous that are incapable 

of cleaving carbon-sulfur bonds (CS- or Dsz-) , are pro- 
duced by exposing a strain of R. rhodochrous . e.g./ ATCC 
No. 53968, that exhibits biocatalytic activity (that is 
CS+ or Dsz+) , to a mutagen under appropriate conditions 

2 0 that are known to or readily ascertainable by those 

skilled in the art. Suitable mutagens include radiation, 
e.g., ultraviolet radiation, and chemical mutagens, e.g., 
N-methyl-N' -nitrosoguanidine (NTG) , hydroxylamine, ethyl- 
methanesulphonate (EMS) and nitrous acid. Mutants thus 
25 formed are allowed to grow in an appropriate medium and 
screened for carbon-sulfur bond cleavage activity. Mu- 
tants identified as lacking carbon-sulfur bond cleavage 
activity are termed CS- . Any method of screening which 
allows for an accurate detection of carbon-sulfur bond 

3 0 cleavage activity is suitable in the method of the present 

invention. Suitable methods of screening for this activ- 
ity include exposing the different mutants to carbon- 
sulfur bond containing molecules (e.g., DBT) and measuring 
carbon-sulfur bond cleavage. In a preferred embodiment, 
3 5 the mutants are exposed to DBT, such that the breakdown 
product, hydroxybiphenyl (HBP) , which fluoresces under 
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short wave ultraviolet light, is produced. HBP can also 
be detected colorimetrically through its blue reaction 
product with Gibbs' reagent. Other methods include gas 
and liquid chromatography, infrared and nuclear magnetic 
5 resonance spectrometry. See Kodama, et al, , Applied and 
Environmental Microbiology, pages 911-915 (1992) and 
Kilbane and Bielaga, Final Report D.O.E. Contract No. DE- 
AC22-88PC8891 (1991). Once CS- mutants are identified and 
isolated, clones thereof are propagated using standard 

10 techniques and subjected to further analysis. 

Concurrent with the mutagenesis of the above-des- 
cribed culture of the CS+ organism, R. rhodochrous . a 
second culture of the same CS+ organism (1) is maintained 
in culture. CS+ organism DNA (3) is extracted from this 

15 culture of R, rhodocrous . Various methods of DNA extrac- 
tion are suitable for isolating the DNA of this organism. 
Suitable methods include phenol and chloroform extraction. 
See Maniatis et al . , Molecular Cloning. A Laboratory 
^anuaj,, 2d, Cold Spring Harbor Laboratory Press, page 

20 16.54 (1989), herein referred to as Maniatis et' al . . 

Once the -DNA is extracted from R. rhodochrous l, the 
DNA (3) is cut into fragments of various kilobase lengths, 
which collectively make up DNA library 5. Various methods 
of fragmenting the DNA of R. rhodochrous to purify DNA 

25 therefrom, including the DNA of the present invention, can 
be used, e.g. ,. enzymatic and mechanical methods. Any 
four-base recognition restriction endonuclease such as 
TaqI or Sau 3 A is suitable for fragmenting the DNA. 
Suitable methods of fragmenting DNA can be found in Mani- 

30 atis et al . . 

The various DNA fragments are inserted into several 
CS- mutant clones of R. rhodochrous ( 2) , with the purpose 
of isolating the fragment of DNA that encodes the bio- 
catalyst of the present invention. The transformation of 

35 a previously CS- mutant cell to a CS+ transformed cell is 
evidence that the inserted DNA fragment encodes a bio- 
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catalyst. Any method of inserting DNA into R. rhodochrous 
which allows for the uptake and expression of said frag- 
ment is suitable. In a preferred embodiment, electro- 
poration is used to introduce the DNA fragment into £^ 
5 rhodochrous . See Maniatis et al . . 

Once transformed, CS+ mutant R* rhodochrous 7 has 
been produced and identified, DNA fragment 9 encoding the 
CS+ biocatalyst can be identified and isolated. The 
encoded biocatalyst can then be produced using the iso- 

10 lated DNA in various methods that are well known and 
readily available to those skilled in the art. In ad- 
dition, the isolated DNA can be sequenced and replicated 
by known techniques, e.g., the techniques described in 
Maniatis et al . . 

15 As noted previously, the above-described method for 

isolating the DNA of the present invention can be applied 
to CS+ organisms other than R. rhodocrous microorganisms, 
e.g., of the strain ATCC No. 53968. Thus, Bacillus sulfa- 
sportare ATCC No. 53969 or Corvnebacterium sp. SY1 can be 

20 used as the source organism for the DNA of the present 
invention. Furthermore, once isolated, the DNA of the 
present invention can be transfected into a non-human host 
organism other than a CS- mutant of the source organism. 
Thus, the DNA of the present invention can be transfected 

25 into, e.g., a suitable strain of Escherichia coli bacte- 
ria. Other types of non-human host organism can also be 
used, including unicellular organisms (e.g., yeast) and 
cells established in culture from multicellular organisms. 
Other methods of isolating the DNA of the present 

30 invention, include variations on the rationale described 
above and depicted in Figure 1. For example, it would be 
possible to randomly insert a CS- DNA plasmid into clones 
of a CS+ strain of R . rhodochrous . DNA encoding a CS+ 
biocatalyst could then be identified by screening for 

35 clones that have been transformed from CS+ to CS-. 
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The recombinant DNA molecule or fragment thereof of 
the present invention is intended to encompass any DNA 
resulting from the insertion into its chain, by chemical 
or biological means, of one or more genes encoding a 
5 biocatalyst capable of selectively cleaving carbon-sulfur 
bonds, said gene not originally present in that chain. 
. Recombinant DNA includes any DNA created by procedures 
using restriction nucleases, nucleic acid hybridization, 
DNA cloning, DNA sequencing or any combination of the 
10 preceding. Methods of construction can be found in Man- 
iatis et al . . and in other methods known by those skilled 
in the art. 

Procedures for the construction of the DNA plasmids 
or vectors of the present invention include those des- 

15 cribed in Maniatis et al. and other methods known by those 
skilled in the art. Suitable plasmid vectors include pRF- 
29 and pRR-6 depicted in Figures 2 and 3, respectively. 
The terms "DNA plasmid" and "vector" are intended to 
encompass any replication competent plasmid or vector 

20 capable of having foreign or exogenous DNA inserted into 
it by chemical or biological means and subsequently, when 
transfected into an appropriate non-human host organism, 
of expressing the product of the foreign or exogenous DNA 
insert (i.e., of expressing the biocatalyst of the present 

25 invention) . In addition, the plasmid or vector must be 
receptive to the insertion of a DNA molecule or fragment 
therepf containing the gene or genes of the present inven- 
tion, said gene or genes encoding a biocatalyst that 
selectively cleaves carbon-sulfur bonds in organosulfur 

3 0 compounds. Procedures for the construction of DNA plasmid 
vectors include those described in Maniatis et al. and 
others known by those skilled in the art. 

The plasmids of the present invention include any DNA 
fragment containing a gene or genes encoding a biocatalyst 

35 that selectively cleaves carbon-sulfur bonds in organo- 
sulfur compounds. The term "plasmid" is intended to 
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encompass any DNA fragment. The DNA fragment should be 
transmittable to a host microorganism by transformation or 
conjugation. Procedures for the construction or extrac- 
tion of DNA plasmids include those described in Maniatis 
5 et al. and others known by those skilled in the art. 

The transformed non-human host organisms of the 
present invention can be created by various methods by 
those skilled in the art. For example, transfection 
electroporation as explained by Maniatis et al. can be 

10 used. By the term "non-human host organism" is intended 
any non-human organism capable of the uptake and expres- 
sion of- foreign, exogenous or recombinant DNA, i.e., DNA 
not originally a part of the organism's nuclear material. 
The method of desulf urizing a fossil fuel of the 

15 present invention involves two aspects. First, a host 

organism or biocatalytic preparation obtained therefrom is 
contacted with a fossil fuel to be desulf urized. This^ can 
be done in any appropriate container, optionally fitted 
with an agitation or mixing device. The mixture is com- 

2 0 bined thoroughly and allowed to incubate for a sufficient 

time to allow for cleavage of a significant number of 
carbon-sulfur bonds in organosulfur compounds, thereby 
producing a desulf urized fossil fuel. In one embodiment, 
an aqueous emulsion is produced with an aqueous culture of 
25 the organism and the fossil fuel, allowing the organism to 
propagate in the emulsion while the expressed biocatalyst 
cleaves carbon-sulfur bonds. 

Variables such as temperature, mixing rate and rate 
of desulfurization will vary according to the organism 

3 0 used. The parameters can be determined through no more 

than routine experimentation. 

Several suitable techniques for monitoring the rate 
and extent of desulfurization are well-known and readily 
available to those skilled in the art. Baseline and . 
3 5 timecourse samples can be collected from the incubation 
mixture, and prepared for a determination of the residual 
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organic sulfur in the fossil fuel. The disappearance of 
sulfur from organosulfur compounds, such as DBT, in the 
sample being subjected to biocatalytic treatment can be 
monitored using, e.g., X-ray fluorescence (XRF) or atomic 
5 emission spectrometry (flame spectrometry). Preferably, 
the molecular components of the sample are first sepa- 
rated, e.g., by gas chromatography. 

The nucleic acid probes of the present invention 
include any nuclear material capable of hybridizing to at 

10 least a portion of the DNA of the present invention. The 
term "nucleic acid probe" includes any nuclear material 
capable of hybridizing to DNA. 

The invention will now be further illustrated by the 
following specific Examples, which are not to be viewed as 

15 limiting in any way. 

EXAMPLE 1. ISOLATION OF DNA ENCODING A DESULFURI Z ATION 
ACTIVE BIOCATALYST. 

As used herein, the term "Dsz+" refers to the ability 

of an organism to utilize thiophenic compounds such as 

20 dibenzothiophene (DBT) as the sole source of sulfur by the 
selective cleavage of carbon-sulfur bonds therein. Rhodo- 
coccus rhodochrpus strain IGTS8 demonstrates the Dsz* 
phenotype. The term «Dsz-» referrs to an organism's 
inability to utilize said thiophenic compounds as a sole 

25 source of sulfur by the selective cleavage of carbon- 
sulfur bonds therein. 

Materials 

Bacterial strains and plasmidg 

Rhodococcus rho dochrous strain I GTS 8 (ATCC No. 
30 53968), obtained from the Institute of Gas Technology 

(Chicago, IL) , was used as a parent strain for production 
of mutant strains which have lost the desulfurization 
phenotype (Dsz-) . Strain IGTS8 was also used for iso- 
lation of DNA fragments capable of complementing said 
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mutants toproduce Dsz+ mutants therefrom. Rhodococcus 
vector pRF-2 9 was obtained from the Institute of Gas 
Technology, The construction of pRF-29 is described in 
Desomer, et al. (1990), Transformation of Rhodococcus 
fascians bv Hioh-Voltage Electropo ration and Development 
of R. fascians Cloning Vectors . APPLIED AND ENVIRONMENTAL 
MICROBIOLOGY 2818-2825* The structure of pRF-29 is sche- 
matically depicted in Figure 2. 

Escherichia coli strain JM109 was used as a host in 
transformation with plasmid constructs derived from the 
plasmids pUC18 and pUC19 (Bethesda Research Laboratories, 
Bethesda, MD) . 

Enzymes and Reagents 

Restriction endonucleases were purchased from 
Bethesda Research Laboratories (BRL) and New England 
Biolabs (Beverly, MA) . T4 ligase and the Klenow fragment 
of E. coli DNA polymerase I were purchased from BRL, HK™ 
Phosphatase was purchased from Epicentre Technologies 
(Madison, WI) . All enzymes were used in accordance with 
manufacturers recommendations. Enzyme assay substrates 
Dibenzothiophene (DBT) , Dibenzothiophene 5-oxide (DBT 
sulfoxide) and Dibenzothiphene sulfone (DBT sulfone) were 
purchased- from Aldrich (Milwaukee, WI) . Gibb's Reagent, 
2 , 6-dicholoroquinone-4-chloroimide , was purchased from 
Sigma (St. Louis, MO). Chemical mutagen N-methyl-N 1 - 
nitro-N-nitrosoguanidine (NTG) was also purchased from 
Sigma . 

Growth Media and Conditions 

E. coli JM109 was grown in L-broth (Difco, Detroit, 
MI) . Transf ormants were selected on L-plates supplemented 
with 1.5% agar and containing 125|ig/ml ampicillin. E. 
coli strains were grown at 37 °C. Rhodococcus strains were 
maintained on Rhodococcus Media (RM) composed per liter 
of: 8.0g Nutrient Broth (Difco), 0.5g yeast extract, 10. Og 
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glucose, Transf onnants of Rhodococcus strains were se- 
lected on RM plates supplemented with 1.5% agar and con- 
taining 25ng/rol chloramphenicol. For expression of the 
Dsz+ phenotype, Rhodococcus strains were grown in Basal 
5 Salts Media (BSM) composed per liter of: 2.44g KH 2 P0 4 , 
5.57g Na 2 HP0 4 2 . 0g NH 4 C1, 0.2 g MgCl 2 .6H 2 0, O.OOlg 
CaCl 2 .2H 2 0, O.OOlg FeCl 3 6H 2 0, 0.004g MnCl 2 .4H 2 0, 6.4ml 
glycerol. Optionally, BSM can be supplemented with 
5.0g/liter glucose. Rhodococcus strains were grown at 
10 30*C. 

Methods 

Sulfur Bioavailability Assay 

The sulfur bioavailability assay, described in U.S. 
Patent 5,104,801, examines an organism's ability to liber- 

15 ate organically bound sulfur from substrates (e.g., DBT, 
DBT sulfoxide, DBT sulfone) for use as the sole source of 
sulfur for growth. In the assay, BSM, which contains no 
sulfur, is supplemented with one or more sulfur containing 
substrates, e.g., DBT. The organism's ability to liberate 

20 sulfur therefrom is measured by its ability to grow with 
proper incubation, as monitored by optical density at 600 
ran, 

Gibbs Assay for 2-Hydroxvbiphenyl 

The oxidative product of DBT, DBT sulfoxide and DBT 

25 sulfone incubated with strain IGTS8 is 2-hydroxybiphenyl 
(2-HBP) . The Gibbs assay colorimetrically quantitates the 
amount of 2-HBP produced from DBT and its above-mentioned 
oxidative derivatives. The assay measures 2-HBP produced 
in culture supernatants after incubation with DBT. The 

3 0 media must be adjusted to pH 8.0 before the Gibb's reagent 
is added. Gibb's Reagent, 2,6- dicholoroguinone^4-chloro- 
imide (lOmg/ml in ethanol) , is added to culture super- 
natants at 1:100 (v/v) . Color development is measured as 
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absorbance at 610nm after a 30 minute incubation at room 
temperature. 

HPLC Assay for 2-HvdroxvbiphenYl 

2-HBP production cultures incubated with DBT can also 
5 be detected by HPLC using instrumentation available from 
Waters, Millipore Corporation, Milford, MA. Reagent 
alcohol is added to culture broth at 1:1 (v/v) in order to 
solubilize all remaining DBT and 2-HBP. Samples are 
agitated for 5 min at 220 rpm. Extracted broth samples 
10 are removed and centrifuged to remove cellular mass. 

Clarified supernatants are then analyzed by HPLC with the 
following conditions: 

Column: Waters 4n Phenyl Novapak 

Detection 

15 Parameters: DBT 23 3nm, 1.0 AUFS 

2-HBP 248nm, 0.2 AUFS 

Quantitative 

Detection Limits: DBT 10 - 250 \xK 

2-HBP 6 - 60 nM 

20 Mobile Phase: Isocratic 70% Acetonitrile 

1.5ml/min 

Retention times: DBT 4.5 minutes 

2-HBP 2.9 minutes 



IGTS8 Mutagenesis 

25 In order to generate mutant strains of R. rhodochrous 

which did not metabolize DBT (Dsz- mutants), biocatalyst 
source strain IGTS8 (Dsz+) was subjected to mutagenesis by 
short-wave UV light" and to chemical mutagenesis with N- 
methyl-N r -nitro-N-nitrosoguanidine (NTG) . With UV ex- 

30 posure mutagenesis, a kill rate of greater than 99% was 

targeted. Continuously stirred R. rhodochrous cells at an 
optical density (A^) of 0.3 were subjected to UV exposure 
from a Mineralight Lamp Model UVG-254 (Ultra-violet Pro- 
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ducts, Inc., San Gabriel, CA) at a distance of 10 cm for 
55 to 65 seconds to obtain this kill rate (97.9-99.9%). 
For NTG mutagenesis, cell suspensions were treated with 
500 ng/ml NTG for a duration determined to achieve a kill 
5 rate of 30%-50%. Combination mutagenesis utilizing both 
NTG and UV was also done. For these an overall kill rate 
of greater than 99.9% was used. Colonies surviving muta- 
genesis were picked onto RM plates and screened for the 
Dsz- phenotype as described below. 

Screening Example A: Initially, a DBT-spray plate screen 
was used to select Dsz- mutants. Mutant colonies were 
replica plated onto Basal Salts Media (BSM) electro- 
phoretic-grade agarose plates which contained no added 
sulfur. Colonies were allowed to grow at 30 'C for 24hr. 
The plates were then sprayed with an even coating of 10% 
DBT dissolved in ether and incubated at 30 °C for 90 min- 
utes. The plates were then wiped clean and observed under 
short-wave UV light. The observed end product of DBT 
metabolism, 2-hydroxybiphenyl (2-HBP) fluoresces under 
short-wave UV light. Colonies that produce 2-HBP are thus 
identified by fluorescent spots on the agarose. Colonies 
that do not produce 2-HBP (that are Dsz-). do not produce 
fluorescent spots. 

Screening Example B: A simpler variation of screening 
25 involved replica plating surviving mutagenized colonies to 
BSM ag-arose plates supplemented with 1.2ml/liter of a 
saturated ethanol solution of DBT. After 24 hours, pro- 
duction of 2-HBP can be visualized under UV illumination 
as above. 



Mutants which did not appear to produce 2-HBP by the 
above-described screening methods were examined with the 
sulfur bioavailability assay, with DBT as the sole source 
of sulfur. Growth of potential mutants was examined in 
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1.25ml liquid fermentations in BSM plus DBT media dis- 
pensed in 24-well plates (Falcon) . i After a one day incu- 
bation at 30 *C, 2-HBP production was monitored by the 
Gibbs colorimetric assay. Strains which continue to 
5 demonstrate the Dsz- phenotype were incubated in larger 
volumes of BSM plus DBT and analyzed for 2-HBP or inter- 
mediates by the HPLC method. Because BSM is a defined 
minimal medium, a duplicate control culture which con- 
tained supplemental inorganic sulfur was grown in order to 

10 distinguish true Dsz-- mutants from auxotrophic mutants. 
Mutants which failed to grow in both the control and 
experimental media were assumed to be auxotrophic mutants. 

Of 1970 individually analyzed potential mutants, two 
were identified as Dsz-. One mutant, GPE-362, was gener- 

15 ated by NTG mutagenesis. The other, CPE-648, was gene- 
rated by combination NTG/UV mutagenesis. Both GPE-3 62 and 
CPE-648 grow slowly in sulfur bioavailability assays, 
presumably from trace amounts of sulfur on the glassware 
or in the media components. However, no* detectable 

20 amounts of 2-HBP were produced by either mutant after an 
extended incubation of 6 to 10 days with DBT, as assessed 
with either the Gibbs assay or the HPLC assay. Thus, 
independently produced R. rhodocrous IGTS8. mutants GPE-3 62 
and CPE-648 were Dsz- organisms. 

25 Vector Construction 

Vector constructs were derived from R. rhodochrous 
and confer chloramphenicol resistance. All constructs 
were developed in E. coli strain JM109. Transformation of 
JM109 was carried out with the Gene Pulsar (Bio-Rad Labo- 

30 ratories, Richmond, CA) according to manufacturer's recom- 
mendations. Plasmid isolation from JM109 was performed by 
standard methods (Birnboim and Doly (1979), A rapid alka- 
line ex traction procedure for screening recombinant plas- 
mid DNA . 7 NUCLEIC ACIDS RES. 1513-1523; Maniatis et al. 

35 (1982), MOLECULAR CLONING: A LABORATORY MANUAL (Cold 
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Spring Harbor Laboratory Press) . Transf ormants containing 
correct vector constructs were identified by restriction 
analysis. 

Vector Construct A: pRR-6 (Figure 3) contains the Rhodo- 
5 cpccus origin of replication and Chloramphenicol resis- 
tance marker (Cm R ) . The ori and Cm" have been removed 
from pRF-29 as a 6.9kb Xhol/Xba (partial) fragment. The 
ends were made blunt with Klenow and ligated to Sall/Xbal 
cut pKF39. pKF39 is pUC18 with the Small site replaced 
10 with a Bglll site. A unique Narl site is available for 
cloning in pRR-6. Narl ends are compatible with 4-base 
recognition endonuclease Taql. 

Transfor mation of Rhodococcus rhodochrous 

Transformation of IGTS8 and Dsz- mutants thereof can 

15 be achieved by electroporation. The following conditions 
were used in all transformations of Rhodococcus rhodn- 
chrous . Cells were grown in RM to mid-log phase and 
harvested by centrif ugation (5000xg) , then washed three 
times in cold, deionized, distilled water and concentrated 

20 50-fold in 10% glycerol. The resulting cell concentrate 
could be used for electroporation directly or stored at - 
80'C. 

Electroporations were carried out with the Gene 
Pulser (Bio-Rad) apparatus. 100 \il cells were mixed with 

25 transformation DNA in a 2-mm gapped electrocuvette (Bio- 
Rad) and subjected to a 2.5 kv pulse via the pulse con- 
troller (25 nF capacitor, 200 Q external resistance) . 
Pulsed cells were mixed with 400 \i RM and incubated for 4 
hours at 30 *C with regular agitation. Cells were then 

3 0 plated to RM supplemented with proper antibiotic. 

When I GTS 8 was transformed with pRF-29, chloram- 
phenicol resistant colonies were cleanly selected at a 
frequency of 10 s - 10 6 /ng DNA on plates containing 25 
pg/ml chloramphenicol. 
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Small Scale Plasmid Preparation from R. rhodochrous 

A single colony of Rhodococcus rhodochrous was used 
to inoculate 2 to 7ml of RM plus 25 ng/ml chloramphenicol. 
The culture was incubated for two days at 30 °C with shak- 
5 ing. Cells were pelleted by centrifugation and resus- 

perided in 300 \il sucrose buffer (20% sucrose, 0.05 M Tris- 
CI pH 8.0, 0.01 M EDTA 0.05 M NaCl , 10 ing/ml lysozyme) and 
incubated at 37'C for 1 hour. 300 \il Potassium acetate- 
acetate solution, pH 4.8 (60 ml 5 H KOAc, 11.5 ml Glacial 

10 acetic acid, 28.5 ml dLH 2 0} , was added and the mixture was 
gently mixed by inversion. The mixture was placed on ice 
for 5 minutes and then cellular debris was pelleted by 
centrifugation. 500 \xl supernatant was removed to a fresh 
tube to which RNAse was added to 0.05 pg/*il and incubated 

15 for 2 0 minutes at 37 *C. The sample was then phenol :- 
chloroform extracted and the aqueous layer was precip- 
itated at -80*0 with an equal volume of isopropanol. DNA 
was pelleted by centrifugation and resuspended in 0.3 M 
NaOAc pH 8.0. DNA was precipitated again at -80 °C with an 

20 equal volume of isopropanol. DNA was pelleted by centri- 
fugation and resuspended in 0.3 M NaOAc pH 8.0. DNA was 
precipitated again at -80 °C with two volumes of 95% EtOH. 
Pelleted DNA was washed with 70% EtOH and resuspended in 
50 nl TE (Tris EDTA) . 

25 Isolation of Genomic DNA from R. rhodochrous Strain IGTS8 
IGTS8 genomic DNA was isolated as described. 2 0 ml 
RM was inoculated with a single colony of IGTS8 and incu- 
bated at 30 °C for 48 hours with shaking at 220 rpm. Cells 
were harvested by centrifugation (5000xg) . Cells were 

3 0 resuspended in 10ml TE (10 Tris Base, 1 jnU EDTA) with 
100 mg lysozyme and incubated for 30 minutes at 30 °C. 
Cells were lysed by adding 1 ml of 20% sodium dodecyl 
sulfate (SDS) . 10 ml of TE-saturated phenol and 1.5 ml 5 
M NaCI were added immediately and the mixture was gently 

35 agitated for 20 minutes at room temperature. Phenol was 
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removed by centrifugation, and the aqueous layer was 
extracted twice with an equal volume of chloroform. An 
equal volume of isopropanol was added to the aqueous layer 
to precipitate the DNA. DNA was spooled onto a pasteur 
5 pipette and redissolved in TE. DNA was then RNased with 
20 \xg/ml RNA for 1 hour at 37 *C. The sample was made to a 
final concentration of 100 jnM NaCl and 0.4% SDS and pro- 
teased with 100 pg/ml protease K. The sample was then 
extracted with phenol and chloroform and precipitated with 
10 isopropanol as before. The purified genomic DNA, which 

included the DNA of the present invention, was resuspended 
in TE. 

Construction of Plasmid Library of IGTS8 

Genomic DNA from the Dsz+ source organism (IGTS8) was 

15 cut with TaqI in order to produce fragments 0.5. - 23 kb in 
length. Cut DNA was electrophoresed through 0.8% low 
melting temperature agarose and DNA fragments greater than 
5 kb in length were isolated and purified by standard 
methods (Maniatis, T. et al . (1982), MOLECULAR CLONING: A 

20 LABORATORY MANUAL (Cold Spring Harbor Laboratory Press)). 
Vector pRR-6 was cut with Narl to completion. The vector 
ends were dephosphorylated with HK™ phosphatase to pre- 
vent religation of the vector. The size-f ractioning 
genomic DNA was ligated to cut and dephosphorylated pRR-6. 

25 Molecular Complementation of Dsz- Mutant Strain cpp-^a 

Plasmid library ligations (above) were used to trans- 
form Dsz- mutant strain CPE-648 by electroporation as 
described. Negative control transformations of CPE-648, 
which did not contain DNA (mock transformations), were 

30 also performed. After the four hour incubation in rm, the 
cells were spun out of suspension by centrifugation and 
the supernatant was removed. The cells were resuspended 
in BSM with no sulfur. These cells were used to inoculate 
250 ml of BSM supplemented with 300 \il of a saturated 
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ethanol solution of DBT. By this procedure, clones which 
are capable of complementing the Dsz- mutation will be 
selected by the sulfur bioavailability assay. Strains 
containing the complementing sequences (i.e., the DNA of 
5 the present invention) will successfully remove the sulfur 
from DBT and grow preferentially. 

After 6 days incubation at 30*C, the cultures were 
assayed for 2-HBP by HPLC. Accumulation of 2-HBP was 
detected in experimental cultures while no accumulation of 

10 2-HBP was detected in control cultures. The culture 
producing 2-HBP was spread onto RM plates supplemented 
with chloramphenicol to obtain single colonies that were 
harboring plasmids. These plates were replica-plated to 
BSM agarose plates supplemented with 1.2 ml/liter of a 

15 saturated ethanol solution of DBT. After 24 hours in- 
cubation at 30 *C, 2-HBP could be detected around some 
individual colonies under short wave UV illumination. 
These colonies presumably harbored plasmids which com- 
plemented the Dsz- mutant by restoring the former Dsz+ 

2 0 phenotype. 

Characterization of Clones Complementing Dsz- Mutant 
CPE-648 

Two independent plasmid libraries successfully com- 
plemented mutant CPE-648 to Dsz* as described above. 

25 Plasmid DNA was isolated from single colonies which demon- 
strated 2-HBP production on BSM plus DBT plates (above) 
from cultures transformed with each of the two libraries. 
This plasmid DNA was used to transform E. coli strain 
JM109. Plasmid DNA was isolated and cut with restriction 

30 endonucleases in order to build a restriction map of the 
clones. Each of the two libraries yielded a single com- 
plementing .clone. By restriction pattern similarities, 
the two clones appear to have overlapping sequences. 
These clones have been designated pTOXI-1 (Figure 4) and 

35 pTOXI-2, respectively. pTOXI-1 contains an insert of 
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approximately 6.6kb. pTOXI-2 contains an insert of ap- 
proximately 16.8kb. 

Complementation of Dsz' Mutant GPE-362 

Dsz- mutant GPE-362 was transformed with plasmids 
5 pTOXI-1 and pTOXI-2. As a control, GPE-362 was also 

transformed with vector pRR-6. Transf ormants containing 
plasmid DNA were selected on RM plus chloramphenicol 
plates. Cm R colonies were transferred to BSM agarose 
plates supplemented with DBT. After 24 hr. incubation at 
10 30 °C, 2-HBP production could be seen around colonies 
containing either pTOXI-1 or pTOXI-2 by short wave UV 
illumination. No 2-HBP could be detected around colonies • 
containing only vector pRR-6. 

Overexpression of the Dsz* Trait Upon Reintroduction of 
15 Cloned DNA 

Plasmids pTOXI-1 and pTOXI-2 were transformed into 
Dsz- mutant strain CPE-648. Transf ormants containing 
plasmid DNA were selected on RM plus chloramphenicol 
plates. The specific activity of individual clones was 

20 examined by the following protocol. 

Single colonies of CPE-648 containing either pTOXI-1 
or pTOXI-2 were used to inoculate 25 ml RM plus 25 fig/ml 
chloramphenicol in a 250 ml flask. As a positive control, 
parent strain IGTS8 was also grown in 25 ml RM. After 48 

25 hours of growth at 30 *C, 225 rpm shaking, 2.5 ml of the 

cultures were crossed into 25 ml BSM supplemented with 0.7 
mM DMSO. Cultures were incubated for an additional 4 0 
hours at 30*C. The optical density of each culture was 
measured at 600 nm against an appropriate blank. DBT was 

3 0 added to a final concentration of 150 ^M and the cultures 
were incubated for 3 hours at 30°C. An equal volume of 
Reagent Alcohol (Baxter, McGaw Park, IL) was then added to 
each culture to solubilize any remaining DBT or 2-HBP. A 
1 ml sample was removed and cellular debris removed by 
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centrifugation. The supernatant was analyzed for 2-HBP by 
the HPLC assay described above. The specific activity is 
calculated as mg of 2-HBP per liter/hours of incuba- 
tipn/OD 600 . The results of the above assay is listed in 
5 Table l. 

TABLE 1: Biocatalytic Desulf urization Activity of Trans- 
formed Mutants 



STRAIN 


OD 600 


2-HBP 
(mg/1) 


Specific Activity 
(mg/l/hr/OD 600 


IGTS8 


2.89 


3.94 


0.45 


GPE-362 \ 


1.53 


0.00 


0.00 


CPE-648 


4.10 


0.00 


0. 00 


CPE648 (pTOXI-1) 


3.84 


15.84 


1. 37 


CPE648 (pTOXI-2) . 


2.88 


5.74 


0.66 



EXAMPLE DNA SEQUENCING OF A DESULFURIZATION ACTIVE 

BIOCATALYST BY THE DIDEOXY METHOD FROM PLASMID 
PTOXI-1 

Materials 

Bacter ial strains and plasmids 

Plasmid pTOXI-l was used as the original source of 
DNA for sequencing. Escherichia coli strain JM109 was 
used as a host for subcloning and plasmid maintenance. 
Plasmids puci8 and pUC19 were purchased from Bethesda 
Research Laboratories (Bethesda, MD) . 

Enzvmes and Reagents 

Restriction endonucleases were purchased from 
Bethesda Research Laboratories (BRL) and New England Bio- 
labs (Beverly, MA) . T4 ligase was purchased from BRL. A 
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Sequenase Version 2.0 DNA sequencing kit was purchased 
from United States Biochemical Corporation (Cleveland, 
OH) . All enzymes and kits were used in accordance with 
manufacturer's recommendations. 

5 Growth Media and conditions 

E* coli strain JM109 harboring plasmids was grown in 
L-broth (Difco) containing 100 ng/ml ampicillin. . Trans- 
formants were selected on L-plates supplemented with 1.5% 
agar and containing 100 jig/ml ampicillin. E. coli strains 
10 were grown at 37 °c. 

Methods 

Plasmid DNA preparation from E. coli 

Plasmid DNA was prepared from E. coli via lysis by 
SDS (Maniatis, et ^TJ . The DNA was further purified 
15 through a polyethylene glycol precipitation before use in 
sequencing reactions. 

Plasmid Subclonina 

The following subclones of pTOXI-1 were generated by 
standard techniques to aid in DNA sequencing: 

a) pMELV-l (Figure 5) was derived by isolating the 
6.7kb iiinddlll/Hdel fragment from pTOXI-1 (shown in Figure 
4) and ligating it to pUC-18 cut with Hindlll/Ndel . JM109 
cells harboring pMELV-l were identified by plasmid iso- 
lation and restriction endonuclease analysis (Maniatis, et 
aL ) . • 

b) pSMELV-lA (Figure 6) contains the 1.6kb Sohl/Xhol 
fragment of pMELV-l subcloned into pUC-18. 

c) pSMELV-2A (Figure 6) contains the 0.7kb BamHI/SacI 
fragment of pMELV-l subcloned into pUC-18. 

d) pSMELV-3A (Figure 6) contains the 3 . 5kb Sacl/Xhol 
fragment of pMELV-l subcloned into pUC-18 . 

e) pSMELV-4A (Figure 6) contains the 1.5kb Sphl/BamHI 
fragment of pMELV-l subcloned into pUC-18. 



25 



WO 94/01563 PCT/US93/06497 



-30- 

Dideoxv Sequencing from Plasmid DNA 

a) Denaturation. Prior to sequencing reactions, 
plasmid DNA must be denatured. This was accomplished by 
treatment with NaOH. The denatured DNA is then recovered 
5 by addition of salt and EtOH precipitation. Preferably, 
2-5 ng of denatured plasmid DNA is used in each sequencing 
reaction. See manufacturer's recommendations with Se- 
quenase Version 2.0 DNA sequencing kit (United States 
Biochemical Corporation) . 

10 b) Dideoxy sequencing. Chain termination dideoxy 

sequencing with Sequenase 2.0 was performed as described 
by the manufacturer (U.S. Biochemical Corporation). 
Sequencing of the cluster was initiated by priming sub- 
clones pMELV-lA, pMELV-2A, pMELV-3A, pMELV-4A with the 

15 4 0 Universal Primer" defined as: 

5 1 -GTTTTCCCAGTCACGAC-3 1 and the "Reverse Primer" defined 
as: 5 ' -AACAGCTATGACCATG-3 1 . The sequence was extended by 
synthesizing overlapping oligonucleotides to previously 
read sequence using the Gene Assembler Plus (Pharmacia, 

20 Piscataway, NJ) . The synthesized oligonucleotides were 
used as primers for continuing sequence reactions. Plas- 
mid pMELV-l was used as the template for all of the re- 
maining sequences. DNA sequence was read from both 
strands of the plasmid clone to increase fidelity, 

25 EXAMPLE 3: COMPLEMENTATION CLONING OF A DESULFURI 2 ATION 
ACTIVE BIOCATALYST FROM A COSMID LIBRARY; 
TRANSFECTION OF BIOCATALYST DNA INTO AN 
R. FASCIANS HOST ORGANISM 

Materials and Methods 



30 Bacterial strains, media and reagents 

Rhodococcus sd. Rhodococcus rhodochrous strain IGTS8 , 
obtained from the Institute of Gas Technology (Chicago, 
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IL) was used, uvi is a mutant of IC-TS8 that is unable to 
desulfurize DBT, described herein. R. fascians D188-5 
(Desomer, et al^, J . Bacteriol . . 170:2401-2405, 1988) and 
R. rhodocfrrous ATCC13808 (type strain from ATCC) do not 
5 metabolize DBT. E. cojLi XLl-Blue (from Stratagene Cloning 
System, La Jolla, CA) is recAl la c thi endAl avrA96 hsdRl7 
sypE44 re3,Al fF' DroAB lacTS lacZ4M15 Tnltl ] . E. coli 
CS109 is W1485 thi supE F' . E. coli S17-1 is a derivative 
of E . colj, 294 and is recA thi oro hsdR i res - mod i [R P4 - 2 - 
10 Tc: :Mu-Km: :Tn2] (Simon, g£ aj^., Plasmid vectors for the 
genetic analysis and manipulation of rhizobia and other 
gram-negative bacteria, p. 640-659. In A. Weissbach, and 
H. Weissbach (eds.), Methods in enzymology, vol 118, 
Academic Press, Inc., Orlando, 1986). 
15 Pseudomonas minimal salts medium (PMS) was prepared 

according to Giurard and Snell (Biochemical factors in 
growth, p. 79-111. In p. Gerhardt, R. g. E. Murray, R. n. 
Costilow, E. W. Nester, W. A. Wood, N. R. Krieg, and G. B. 
Phillips (eds.), Manual of methods for general bacteriol- 
20 ogy, American Society for Microbiology, Washington, DC, 
1981) and contained 0.2% glycerol, 40 mM phosphate buffer 
(PH 6.8), 2% Hutner's mineral base, and 0.1% (NH 6 ) 2 so 4 . 
PMS medium lacking sulfate was prepared with chloride 
salts in place of sulfate salts. Luria broth (LB) was 1% 
25 bactotryptone , 0.5% yeast extract, and 1% NaCl. All 

liquid medium incubations were performed with shaking in 
water baths (New Brunswick Scientific, Edison, NJ) . 
Ampicillin (50 fig/ml) and tetracycline (12.5 pg/ml) were 
included as selective agents when required. Dibenzothio- 
3 0 phene (DBT) was purchased from Fluka Chemical Corporation 
of Ronkonkoma, NY. DBT-sulf oxide was from ICN Bio- 
chemicals of Irvine, CA, and DBT-sulfone was obtained from 
Aldrich Chemical Company of Milwaukee, WI. Agarose was 
obtained from BRL. 
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Plasmid vectors 

pLAFRS (Keen, et aL, Gene 20:191-197, 1988) and 
pRF29 (Desomer, et al . , 1988) served as sources of the 
Rhodococcus plasmid origin of replication. 

5 Cosmid library construction 

High molecular weight DNA was isolated from I GTS 8 by 
the method of Consevage et sJL, ( J. Bacteriol . . 162 : 138- 
146, 1985), except that cell lysis was accomplished in TE 
(10 mM Tris-HCl, 1 mM EDTA, pH 8.0) containing lysozyme (5 

10 mg/ml) and SDS (2%) . The DNA was partially digested with 
Sau3AI and fragments of approximately 20 kb were isolated 
after centrifugation through a sodium chloride gradient 
(Frischauf, et al . . Digestion of DNA: size fractionation , 
p. 183-189. la S. L. Berge, and A. R. Kimmel (eds.), 

15 Methods in Enzymology, vol 152, Academic Press, Inc, San 
Diego, CA, 1987). These fragments were ligated into the 
Bam HI site of pLAFRS using standard procedures. In vitro 
packaging was performed using Gigapack Plus (Stratagene) . 
Packaged cosmids were transduced into E. coli S17-1. 

20 DBT sprav plate assay 

A spray plate assay for the identification of bac- 
teria capable of modifying dibenzothiophene (DBT) was 
originally described by Kiyohara et al, ( AppI . Environ. 
Microbiol. , 4J3: 454-457, 1982) and modified by Krawiec 

25 (Bacterial desulfurization of thiophenes: screening tech- 
niques and some speculations regarding the biochemical and 
genetic bases, p. 103-114. In G. E. Pierce (ed.), Develop- 
ments in Industrial Microbiology, vol 31, Society for 
Industrial Microbiology, Columbus, Ohio, 1990). The assay 

30 was further modified for use with R. rhodochrous IGTS8 as 
follows. Cells from individual IGTS8 colonies were trans- 
ferred to LB plates as small (0.5 cm) patches and were 
incubated at 30°C for 24 to 36 h. Large amounts of cells 
from these patches were transferred onto PMS-1% agarose 
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plates that lacked a source of sulfur. These. plates were 
immediately sprayed with a 0.1% DBT solution in ethyl 
ether. The PMS-DBT plates were incubated at 30 °C for a 
minimum of 18 hours and fluorescent products around the 
5 patches were detected by viewing under short-wave (254 nm) 
UV illumination. 

Sulfur bioavailability assay 

I GTS 8 was incubated in PMS medium at 30 *C for 2 4 to 
48 h, the. cells were pelleted by centrifugation, followed 

10 by two washes with sulfur-free PMS . Washed cells were 
inoculated into PMS that contained, as a sole source of 
sulfur, a 0.2% concentration of one of the following: DBT, 
DBT-sulf oxide, or DBT-sulfone. The inoculum was adjusted 
so that the beginning absorbance at 600 nm (A 600 ) was 0.02. 

15 The culture was incubated at 30 'C and growth was monitored 
at A 600- For cultures incubated with DBT, the supernatant 
was viewed at various intervals under short wave UV light 
to check for production of fluorescent products. 

Plasmid iso lation and hybridizations 

20 Cosmid DNA (pLAFR5) was isolated from E. coli as 

described by Ish-Horowicz and Burke ( Nucl. Acids Res. . 
9:2989-2998, 1981), and from Rhodococcus species as des- 
cribed by Singer and Finnerty ( J . Bacteriol . r 170:638-645, 
1988). Large scale cosmid preparations were carried out 

25 according to Birnboim and Doly ( Nucl. Acids Res, . 7 : 1413- 
1423, 1979). DNA hybridization experiments were performed 
according to Southern ( J. Molec. Bio] . r 98:503-517, 1975). 
DNA was labelled with 32 P-dCTP (Amersham) , using the ran- 
dom primer method of Feinberg and Vogelstein ( Anal. Bio- 

30 chem. . 137 : 266-267 , 1984 ) . 



UV mutagenesis of IGTS8 - 

IGTS8 was incubated overnight in LB at 30° C and 
approximately 3000 colony forming units were spread onto 
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fresh LB plates. These plates were immediately exposed to 
short wave UV light (254 nm) for 5 to 20 s at a distance 
of 3.5 cm. Plates were incubated at 30*C for 48 h or 
until colonies developed. Colonies from plates exhibiting 
5 >50% cell death were assayed for their ability to meta- 
bolize or desulfurize DBT, using the spray plate assay. 

Electrotransf ormation of Rhodococcus 

R. rhodochrous IGTS8 and the UV1 mutant were trans- 
formed with plasmid DNA via electroporation (Gene Pulser, 

10 Biorad Laboratories, Inc, Hercules, CA) . The bacteria 

were grown in LB for 24 to 48 h at 30'C, diluted to an A 600 
of 0.15 with fresh LB, and incubated at 30 °C for an ad- 
ditional 4 h. Cells were collected by centrifugation and 
washed four to five times with 0.3 M sucrose and finally 

15 resuspended to -5 x 10 9 cells/ml in 0,5 M sucrose. To an 
ice cold 0.2 cm electroporation cuvette (Biorad), was 
added 40 yl of this bacterial solution. The cells were 
pulsed at 25 pF and 2.5 kV with the Pulse Controller at 
800 ohms and were immediately diluted with 1 ml- of LB 

2 0 containing 0.5 M sucrose. The cells were incubated at 

30 °C for 1 h, plated on LB agar plates plus appropriate 
antibiotics, and incubated at 30 6 C until colonies de- 
veloped. When the plasmid carried the pRF29 Rhodococcus 
plasmid origin of replication, colonies were visible after 
25 48 h. In the absence of the pRF29 origin, colonies ap- 
peared after 4 to 5 days. 

R. fascians D188-5 was transformed by electroporation 
in a similar manner but, due to its slower growth rate, it 
was incubated in LB overnight until it reached an A 600 of 

3 0 -2.0. The cells were washed and resuspended in distilled 

water instead of sucrose. The Pulse Controller was set at 
4 00 ohms and the recovery period after electroporation was 
in LB for 4 h before plating onto selective media. Suc- 
cessful transformation of R. fascians D188-5 with E. coli 
35 plasmids required that the DNA be methylated in vitro 
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10 



beforehand, using the CpG methylase, Sss I (New England 
Biolabs, Inc., Beverly, MA). 

Gas chromatogra phy and mass spectroscopy 

Cells were incubated overnight in LB medium at 30 °C 
and 100 jil was used to inoculate 50 ml of PMS minimal 
medium. The culture was incubated at 3 0 *C for 4 days, 
washed twice with sulfur-free PMS and the pelleted cells 
were inoculated into 50 ml of PMS that contained 0.1% DBT 
as the sole source of sulfur. These cells were incubated 
at 30 *C for 24 h and the supernatant was stored frozen at 
-20 'C. For assays involving R. fascians D188-5, incu- 
bation times were increased 2 to 3-fold. 

Sample preparation and chemical analyses were per- 
formed as described (Olson, a* a^, Energy & f u «i« . S ub- 
15 aitted, 1993). Briefly, each sample supernatant (-50 ml) 
was thawed and residual insoluble material was removed by 
centrifugation. The cleared supernatant was acidified 
with HC1 to pH 1.0 and then extracted three times with 50 
ml of ethyl acetate. Insoluble material from the centri- 
fugation step was also extracted with ethyl acetate. The 
ethyl acetate extracts were combined, dried over anhydrous 
calcium chloride, filtered, and ethyl acetate was removed 
by rotary evaporation. A known amount of internal stan- 
dard (octadecane in chloroform solution) was added to the 
sample, which was then analyzed by GC/FID (gas chromato- 
graphy/ flame ionization detection) and GC/FTIR/MS (gas 
chromatography/ Fourier transform infrared/mass spectro- 
metry) . m some samples, the acidic components in the 
ethyl acetate extract or in the post-extraction aqueous 
layer were methylated by treating with an ether solution 
of diazomethane . 

The analyses were performed on a serially interfaced 
GC/FTIR/MS system as previously described (Diehl, et al. . 
Spectros. Int. j. , 8; 43-72, 1990, Olson and Diehl, Anal. 
35 Chesu, 19:443-448, 1987). This system consisted of the 



20 



25 



30 
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Finnegan ion trap (ITD 800) operated with the AGC on and 
the Nicolet 20SXB Fourier transform infrared spectrometer. 
Gas chromatography was conducted with a 30 m x 0.32 mm DBS 
column (1.0 urn phase thickness) with a 2.0 ml/min helium 
5 carrier flow rate measured at 330 *C. On-column injections 
were utilized for sample introduction because the sulf- 
oxides and sulfones are thermally unstable and they decom- 
pose in split or splitless injectors (Vignier, £t al . . J . 
High Resol. Chromatoor. & Chromatoor. Commun. . 6:661-665, 

10 1983) . The oven temperature program was as follows: 40 'C 
injection, followed by increases in temperature at rates 
of 20°C/min to 80°C, 5°C/min to 200°C, 10*C/min to 330°C, 
and hold for 5 min. GC/FID analyses were performed with a 
HP 5880A with a similar column and program for flow rate 

15 and oven temperature. 

Results 

Isolation of a Dsz- Mutant of R. rhodochrous IGTS8 

When cloning from a foreign bacterial genus into E. 
coli ; not all genes are expressed nor are all protein 

20 products active. To assure that cloned desulfurization 
genes would be expressed in the host cell, a mutant of R. 
rhodochrous IGTS8 that could no longer desulfurize DBT was 
isolated. Using this mutant as a cloning recipient would 
insure that the cellular environment was appropriate for 

25 gene expression and protein function, thereby allowing 
screening for cloned desulfurization genes by complemen- 
tation. 

R. rhodochrous IGTS8 was mutagenized by exposure to 
UV light, and 1000 survivors were screened for the ability 

3 0 to produce a UV fluorescent product in the DBT spray plate 
assay. Three potential desulfurization negative mutants 
were identified and then re-evaluated in the sulfur bio- 
availability assay. Two mutants (designated UV1 and UV23) 
could not use DBT or DBT-sulfone as sole sources of sulfur 

35 and thus appeared to be Dsz-. When grown in the presence 
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of DBT, mutant UVl could not metabolize DBT to 2-HBP or to 
any other potential intermediate, as measured by GC/MS 
analysis. Therefore, strain UV1 was considered to be Dsz- 
and was used as the host for complementation studies to 
5 identify clones that carried desulfurization genes. 

Cosmid cloning of desulfurization genes 

DNA from Dsz+ source organism IGTS8 was used to 
construct a library in the cosmid vector, pLAFRS . This 
library was transduced into E. coli S17-1 and plasmids 

10 were isolated from approximately 25,000 colonies. These 
cosmids were electroporated into R. rhodochrous UV1, a 
Dsz- mutant of IGTS8, with an efficiency of -300 trans- 
formants/iig DNA. Various numbers of UVl transf ormants 
were pooled and incubated for 18 hours at 30 'C, after 

15 which the cells were washed twice and resuspended in 

sulfate-free PMS . Approximately 7 x 10 8 pooled cells were 
inoculated into 100 ml of PMS with DBT as the sole source 
of sulfur. A predicted product. of the DBT desulfurization 
reaction is 2-HBP, which is fluorescent when exposed to UV 

20 light. Therefore, batch cultures were grown at 30 °C and 
the supernatants were observed for fluorescence. Approxi- 
mately 33 00 UVl transf ormants were screened in four sepa- 
rate batches. In one batch (representing -600 transform- 
ants) a UV fluorescent product appeared in the supernatant 

25 after five days' incubation. Individual colonies were 

isolated and twelve of these continued to produce a fluor- 
escent product when exposed to DBT. 

Attempts to recover cosmid DNA from these isolates 
failed, so Southern hybridizations were performed to 

30 determine if the cosmids had become integrated into the 
chromosome of strain UVl. Chromosomal DNA was isolated 
from seven transf ormants and digested with EcoRI. After 
agarose electrophoresis and blotting, the fragments were 
hybridized with 32 P-labelled probes derived from pLAFRS. 

35 In all transformants tested, pLAFRS probes hybridized to a 
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DNA fragment -20 kb in size. Vector derived probes did 
not hybridize to the control I GTS 8 genome. Therefore, the 
desulfurization positive cosmid clones had apparently 
integrated into the chromosome of strain UV1. 
5 Since the plasmids had integrated into the chromo- 

some, the genomic DNA connected to either side of the 
plasmid cloning site must represent R. rhodochrous I GTS 8 
sequences that were able to complement the Dsz- mutation 
in strain UV1. (This would be true regardless of whether 

10 the mode of integration was by homologous or illegitimate 
recombination.) Sequences were recovered that flanked the 
inserted plasmid from three desulfurization positive 
* transformants by digesting genomic preparations with Eco RI . 
or BamHI. These enzymes cut pLAFRS once in the polylinker 

15 region so that an intact sequence of pLAFRS could be 

recovered, linked to a neighboring chromosomal fragment 
from IGTS8. The digested DNA was ligated to itself (at a 
concentration of -20 ng/yl) and was transformed into E. 
coii S17-1. Sixteen tetracycline resistant colonies were 

2 0 obtained, seven from the Bam HI digestion and nine from the 

EcoRI digestion. Restriction enzyme analysis revealed 
that all the EcoRI-rescued clones contained a 2,1 kb 
fragment of IGTS8 DNA. The Bam HI-rescued clones contained 
a 1.65 kb fragment from IGTS8. 

25 The 2.1 kb IGTS8 DNA from the £coRI rescue experiment 

was used as a template to make labelled DNA probes, which 
were hybridized to colony lifts of the original, intact 
cosmid library in E. coli . Of 5000 colonies, 17 hybri- 
dized with the I GTS 8 probes. Cosmid DNA was isolated from 

30 each clone and transformed into strain UV1. Three of the 
seventeen DNA preparations complemented the Dsz- pheno- 
type. 

A restriction map for this region was constructed, 
using EcoRI and Hindi II. Probes from the 2 . 1 kb IGTS8 DNA 

3 5 hybridized to the 4.5 kb Eco RI fragment. All cosmid 

clones that conferred the Dsz+ phenotype contained the 
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entire 4.5 kb EcoRI fragment and portions of the 4.5 kb 
££oRI-HindIII and 18 kb Eco RI fragments. These results 
indicated that the desulfurization genes lay within a 15 
kb region. 

5 Subclonina the desulfurization genes 

The 4.5 kb £c£RI and the 4.5 kb EcoRl-ffindlll frag- 
ments were subcloned into pLAFRS , but neither fragment 
complemented the Dsz- mutation of strain UVl. The 9.0 kb 
EcoRI fragment from GE1-H, the 15.0 kb EcoRI -Hindi IT 

10 fragment from GE1-C, and the 18 kb EcoRI fragment from 
GE1-K were subcloned into pLAFRS to yield the plasmids 
pSAD60-28, pSAD48-12, and pSAD56-6, respectively. When 
transformed into UVl, all three produced UV fluorescent 
products from DBT in the spray plate assay, consistent 

15 with the localization of the Dsz+ phenotype as determined 
by restriction mapping. Construction of additional sub- 
clones from this region narrowed the location of the 
relevant genes to a 6.5 kb BstBI fragment. 

Nature of the m utation in strain UVl 
2 0 Genomic blots of EcoRI digested I GTS 8 and UVl DNA 

were hybridized with probes produced from the 2 . 1 kb 
EcoRI-rescued fragment of IGTS8. No hybridization was 
detected to UVl DNA, indicating that the UVl mutation is a 
large deletion and not a simple point mutation. 

25 A Rhodococcus p lasmid origin of replication increases 
transformation of UVl 

Electroporation of UVl with pSAD48-12 typically 
resulted in a low transformation efficiency (-550/ng DNA) 
and only about 50% of the transformants exhibited the Dsz+ 

30 phenotype (presumably because DNA had been lost or re- 
arranged during recombination with the chromosome) . To 
improve the transformation efficiency, a 4.5 kb Hindi I I 
fragment from pRF29 was cloned into the Hindlll site of 
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pSAD48-12, resulting in pSAD74-12. This 4.5 kb fragment 
contains a Rhodococcus plasmid origin of replication, 
which allowed pSAD74-12 to replicate as a plasmid in 
strain UV1. This clone transformed UV1 with an efficiency 
5 of greater than 10 4 transf ormants/pg DNA. Nearly 100% of 
these transf ormants exhibited the Dsz+ phenotype. Unfor- 
tunately, the yield of plasmid prepared directly from UVl 
was so poor that DNA from minipreparations could not be 
visualized on agarose gels. However, plasmid isolated 
10 from UVl could be used to transform E. coli S17-1, from 
which large amounts of the plasmid were prepared. 

The Dsz+ phenotype is not expressed in E. coli S17-1 

E. coli S17-1 was transformed with pSAD48-12 and 
desulfurization activity was measured with the spray plate 
15 assay. No positive colonies were identified. It was 

possible that the E. coli polymerase could not recognize 
the IGTS8 promoter (s) in pSAD48-12, so the IGTS8 DNA was 
placed under control of the E. coli lac promoter. The 15 
kb EcoRI-Hindlll IGTS8 fragment from pSAD4 8-12 was sub- 

2 0 cloned into the pBluescript vectors, SK* and KS*, so that 

the I GTS 8 fragment was cloned in both orientations with 
respect to the lac promoter. Neither clone expressed the 
Dsz+ phenotype in E. coli XLl-Blue. It is not yet known 
whether this stems from poor transcription or translation 
25 of the cloned genes or whether the overproduced proteins 
are inactive in E. coli S17-1. 

The Dsz+ aene or genes are expressed in R. fascians 

Since the cloned genes were either not expressed or 
produced inactive proteins in E. coli . efforts were initi- 

3 0 ated to express the genes in other Rhodococcus species. 

R. fascians D188-5 exhibited no desulfurization in the DBT 
spray plate assay or in the sulfur bioavailability assay. 
Initial attempts to transform R. fascians with the desul- 
furization positive plasmid, pSAD74-12 were unsuccessful. 
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Other Rhodococcus species are known to have endogenous 
restriction systems that cleave DNA at Sail-like restric- 
tion sites. Since pSAD74-12 contained multiple Sai l 
recognition sequences, CpG methylase, Sss l. was used to 
5 methylate pSAD74-12 in vitro , with methylated pSAD74-12 
DNA, transformants of R. faseians D188-5 were obtained 
with an efficiency of about 7 x 10 3 transformants/pg DNA. 
These transformants displayed the Dsz+ phenotype in the 
spray plate assay and GC analysis of liquid medium super- 

10 natant revealed the formation of 2-HBP from DBT. 

Efforts to transform pSAD74-12 into a second species, 
R. rhodorhrnns ATCC13808 were ineffective, despite the use 
of unmethylated or CpG-methylated plasmid. It is possible 
that the electroporation conditions for ATCC13808 were not 

15 optimal, though a wide range of conditions was tested. it 
seems more likely that ATCC13808 has a restriction system 
that is not inhibited by CpG methylation. 

2-HBP is the ma jor desulfurization product 

The predominant metabolite produced from DBT by R^. 

20 rhodochrous IGTS8 is 2-HBP, with small amounts of 2'- 
hydroxybiphenyl-2-sulfinic acid (DBT-sultine) and 2'- 
hydroxybiphenyl-2-sulfonic acid (DBT-sultone) also identi- 
fied by GC/MS analysis (Olson, e£ aj_., Energy ft Fnp] ? i n 
press, 1993). These products were also produced by IGTS8 

25 in this work (Table 2). Neither R. faseians D188-5 nor R^. 
rhodochrous Dsz- mutant UV1 produced these products from 
DBT. However, when R. faseians D188-5 was transformed 
with plasmid pSAD74-l2 and when the R. rhodochrnns uvi 
mutant was transformed with plasmid pSAD104-l0, these 

30 bacteria produced products from DBT that were identical to 
those identified for R. rhodochrnns I GTS 8 (Table 2) . In 
particular, 2-HBP was produced in large quantities, indi- 
cating that carbon-sulfur bond specific desulfurization of 
DBT was mediated by products of genes cloned from I GTS 8 . 
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One subclone, pSAD90-ll, carried a DNA fragment that 
was supposedly identical to that cloned into pSAD104-10, 
but the two plasxtiids differed in the results they produced 
when introduced into R. rhodococcus UVl. In the plate 
5 assay, the surface film of DBT disappeared from the vicin- 
ity of colonies that contained pSAD104-10, producing a 
clear zone, and a fluorescent halo appeared around those 
colonies. On the other hand, when cells contained pSAD90- 
11, no fluorescent products were produced but a zone of 

10 DBT clearing did form around each colony. GC/MS analysis 
showed that no 2-HBP was produced by cells containing 
pSAD90-ll, but that a significant amount of DBT-sultone 
did accumulate (Table 2). The sultone does not accumulate, 
in the parent strain, UVl (data not shown). These obser- 

15 vations imply that when the 9.0 kb Eco RI fragment was 
subcloned into pSAD90-ll the DNA was damaged so as to 
inactivate the gene(s) encoding .the enzyme (s) that convert 
the sultone to 2-HBP. This suggests that at least two 
enzymes are involved in desulfurization and that the 

2 0 sultone may be an intermediate in the pathway. This 
result is consistent with the kinds of metabolites de- 
tected in the original isolate, R. rhodochrous IGTS8 
(Olson, et al., 1993). 
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phenyl-2-sulf onic acid (detected as dibenz [c, e] [1, 2 ]- 
oxathiin 6, 6-dioxide) ; DBT-sultine, 2 1 -hydroxybi- 
phenyl-2-sulf inic acid (detected as dibenz[c,e] [1,2]- 
oxathiin 6-oxide) ; dibenzothiophene sulfone; 2-HBP, 2- 
5 hydroxybiphenyl (Krawiec, pg. 103-114. In G. £ . Pierce 

(ed.)# Developments in Industrial Microbiology, vol 
31, Society for Industrial Microbiology, Columbus, 
Ohio, 1990). 

b 9.0 kb Eco RI DNA fragment from IGTS8 subcloned into 
10 pLAFRS , plus the origin of replication from pRF2 9. 

c Mutated 9,0 kb EcoRI DNA fragment from IGTS8 subcloned 
into pLAFRS, plus the origin of replication from 
pRF29. 

d 15.0 kb EcoRI -Hindi I I DNA fragment from IGTS8 sub- 
15 cloned into pLAFRS, plus the origin of replication 

from pRF29. 

e Presence of metabolites is reported in relative 

amounts from very large amounts (+++++) to very small 
(+) , i.e., trace amounts. 

2 0 IGTS8 cannot use DBT-sulf oxide as a sulfur source 

R . rhodochrous IGTS8 was incubated in minimal medium 
with one of the following as the sole source of sulfur: 
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DBT, DBT-sulfoxide, or DBT-sulfone. IGTS8 was incapable of 
utilizing the sulfur supplied by DBT-sulfoxide but grew 
well in the presence of DBT or DBT-sulfone. DBT-sulfoxide 
was not toxic to cells when grown in a rich medium (LB) . 
5 Therefore, either IGTS8 cannot transport or otherwise act 
on DBT-sulfoxide, or else DBT-sulfoxide is not a true 
intermediate of the desulfurization pathway. 

EXAMPLE 4: ' DNA SEQUENCING OF A 9763 NUCLEOTIDE ECORI- 

SAU3AI FRAGMENT CONTAINING THE GENE OR GENES ■ 
10 FOR THE DESULFURIZATION BIOCATALYST OF I GTS 8 BY 

THE METHOD OF SANGER ET AL. 
A 9763 nucleotide EcoRI-££u,3AI fragment containing the 
gene or genes responsible for the Dsz+ phenotype was iso- 
lated from the I GTS 8 source organism. The DNA sequence of 
15 this fragment was determined from both strands of DNA using 
the dideoxy chain-termination method of Sanger et al . 
(1977), DNA sequencing wi th chain-termination inhi hi t.nr<; , 
74 PROC. NATL. ACAD. SCI. USA 5463-5467, a modified T7 DNA 
polymerase (USB) and [o- 35 S]-dCTP (Amersham) . Deletion 
20 clones for DNA sequencing were constructed in pBluescript 
(Stratagene) using exonuclease III and the methods of 
Henikoff (1984), Unidirect ional digestion with exnniirUaco 
III creates targeted brea kpoints for DNA sequencing, 28 
GENE 351-359. 

25 Sequences from 141 individual deletion clones were 

used to reconstruct the entire 9763 nucleotide fragment. 
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Computerized sequence assembly was performed using DNA 
Inspectorll (Textco, Hanover, NH) . The DNA sequence was 
determined independently for each strand of DNA/ but the 
entire 9763 nucleotide fragment was not completely se- 
5 guenced on both strands. The sequence determined from one 
strand of DNA covered 95% of the 9763 nucleotide sequence. 
On the other DNA strand, 96% of the sequence was deter- 
mined. The sequence was determined from at least two 
independent deletion clones for the entire 9763 nucleotide 
10 fragment. 

EXAMPLE 5: FURTHER RESOLUTION OF THE SEQUENCE. OF PTOXI-1 

AND OPEN READING FRAMES (ORFS) ENCODED THEREIN? 
DSZ+ PROMOTER ENGINEERING; EXPRESSION OF THE 
DSZ+ PHENOTYPE. IN A HETEROLOGOUS HOST ORGANISM; 
15 MAXICELL ANALYSIS OF DESULFURIZATION GENE 

EXPRESSION PRODUCTS 
Organization of the desulfurization cluster 

Sequencing of pTOXI-1, the results of which are set 
forth below in the Sequence Listing, predicted three nearly 
2 0 contiguous open reading frames (ORFs) on one strand of the 
clone (Figure 7) . The sizes of each ORF are predicated as 
1359 bases (bps 786-2144) for ORF 1, 1095 bases (bps 2144- 
3238) for ORF 2 and 1251 bases (bps 3252-4502) for ORF 3. 
Subclone analysis described below has revealed that ORFs 1, 
2 5 2 and 3 are required for the conversion of DBT to 2-HBP and 
that all of the genes encoded by these ORFs are transcribed 
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on a single transcript as an operon. All subclones des- 
cribed below are maintained in E. coli - Rhodococcus shut- 
tle vector pRR-6. Activity of each subclone was determined 
by growing transformants of Dsz- strain CPE-648 in a rich 
5 media (RM) for 48 hours. 1 ml of the culture was used to 
inoculate 2 5 ml BSM supplemented with greater than 100 \xK 
DBT or DBT-sulfone. Cultures were assayed for desulfuri- 
zation products after 48 - 120 hours. A diagram of each of 
the subcloned fragments is shown in Figure 8. 
1° In subsequent studies, the subclones were grown in 

rich media with chloramphenicol, then crossed into BSM 
supplemented with 100 jiM of either DBT or DBT-sulfone. 
Cultures were shaken at 30oc for 2-5 days and assayed for 
desulfurization products by HPLC 

15 A. pENOK-1: A. subclone was constructed which contains 
the 4.0 kb §£hl fragment of pTOXI-l. This fragment spans 
ORFs 1 and 2 but truncates ORF 3. Analysis of pENOK-1 
containing transformants revealed the production of no 
products when incubated with DBT. However these trans- 

2 0 formants were capable of producing 2 -HBP from DBT-sulfone. 

B. pENOK-2: A suclone which contains the 3,6 kb SacI 
fragment of pTOXI-1 was constructed. This fragment con- 
tains ORFs 2 and 3 but truncates ORF 1. Analysis of pENOK- 
2 transformants revealed no production of any desulfuri- 
25 zation products from either DBT or DBT-sulfone. The lack 
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of any activity detectable from either ORFs 2 or 3 suggests 
that the ORFs are arranged as an operon with transcription 
mediated from a single upstream promoter. Presumable, this 
promoter has been removed in this subclone. 

5 C. pENOK-3: A 1.1 kb Xho l deletion mutation of pTOXI-1 
was constructed. Both ORFs 1 and 2 are truncated. ORF 3 
remains intact. Transf ormants harboring pENOK-3 show 
production of DBT-sulfone from DBT. No production of 2-HBP 
is detected from either DBT or DBT-sulfone. It should also 

10 be noted that at the nucleotide level, a deletion of this 
type would not result in a polar mutation. The sequence 
predicts an in-frame splicing of ORFs 1 and 2 which would 
produce a hybrid protein that is presumably inactive. 
However, by avoiding stop codons, the putative single mRNA 

15 transcript remains protected by ribosomes allowing for 

translation of ORF 3. The ability of the ORF-3 product to 
produce DBT-sulfone from DBT demonstrates that DBT-sulfone 
is a true intermediate in the carbon-sulfur bond specific 
biocatalytic desulfurization pathway of IGTS8. 

20 D. pENOK-11: The 3.4 kb Nc^I fragment from pTOXI-l was 
subcloned into a unique Ncol site of pRR-6. This fragment 
contains all of ORFs 2 and 3 but truncates the 5 1 end of 
0RF1. Transf ormants with pENOK-11 demonstrated no desulfu- 
rizing-specif ic enzymatic activity towards DBT or DBT- 

25 sulfone. This indicates essential coding regions bordering 



WO 94/01563 



PCT/US93/06497 



-49- 

this fragment. This is consistent with the predication 
that the entire cluster is expressed on a single transcript 
as discussed for subclone pENOK-2. Again, the promoter for 
gene transcription is not present in this subclone. Sub- 
5 clone pENOK-13 (below) corroborates this prediction. 

E. pENOK-13: A subclone of pTOXI-1 was constructed which 
had a 2,6 kb Soh l- Xho l deletion. This subclone only con- 
tains an intact ORF 3. ORF 1 is lost completely and ORF 2 
is truncated. This subclone showed no desulfurizing-speci- 

10 fic enzymatic activity towards DBT or DBT-sulfone. This 
result should be compared with the phenotype of pENOK-3 
which demonstrated production of DBT-sulfone from DBT. 
Because pENOK-13 differs from pENOK-3 by the additional 
deletion of the smaller Sshl/Xhol fragment, this would 

15 indicate an element in the 1.6 kb Sohl/Xhol fragment which 
is essential for gene expression. Because sequencing has 
revealed no significant ORF's contained in this region, it 
is postulated that a promoter element may be present in 
this region. 

20 F. pENOK-16: A subclone of pTOXI-1 was designed which 

eliminates nearly all unnecessary sequences from the desul- 
furization cluster. This construct contains the 4 kb 
BstBI-SnaBI which presumably contains all essential se- 
quence for complete desulfurization in that in contains all 

25 of ORFs 1, 2 and 3 as well as 234 bases of upstream se- 
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quence. The 3 1 Sna BI site lies 80 base pairs beyond the 
termination of ORF 3. CPE-648 harboring this plasmid was 
capable of converting DBT and DBT-sulfone to 2 -HBP. pENOK- 
16 thus represents the smallest amount of the cluster yet 
5 observed which demonstrates the complete desulfurization 
phenotype. 

G. pENOK-18: This subclone contains a Nsi l- Bfa l fragment 
of pTOXI-1. The Nsi l site is 23 bp downstream of the 
predicted start site of ORF 1. CPE-64 8 harboring this 

10 subclone lacks desulfurization activity on both DBT and 
DBT-sulfone. This subclone most likely eliminates the 
promoter region and truncates the first structural gene. 

H. pENOK-Nsi: To help further elucidate the start site 
of ORF 1, a subclone was made in which a 4 bp deletion is 
introduced at the unique Nsi l site which is 2 3 bp down- 
stream of the predicted start site of ORF1. The mutation 
was generated by cutting with Nsi l and blunting the ends 
with T4 DNA Polymerase . If the Nsi l site is within the 
first, structural gene this frameshift mutation would cause 
an early stop signal in ORF 1. Transf ormants of pENOK-Nsi 
were capable of producing DBT-sulfone from DBT. However, 
no production of 2-HBP was detected indicating that the 
mutation had disrupted an essential structural gene. 

In subsequent studies, due to the clear expression of 
the ORF-3 encoded oxidase, in this clone, it was considered 



20 
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likely that the ORF-2 product would also be expressed* 
Accordingly, ORF-2 alone is incapable of further metabolism 
of DBT-sulfone. 

I. pENOK-19: A subclone of pTOXI-1 was constructed which 
5 contains a deletion from the Hqtl site, which is in the 
earlier part of ORF 2, to the SnaBI which is after ORF 3. 
This subclone should demonstrate the activity of ORF 1 
alone. CPE648 transf ormants harboring this subclone dis- 
played no enzymatic activity towards DBT or DBT-sulfone. 
10 The results of oENOK-Nsi and pENOK-19, taken together, 

suggest that the ORF-I and ORF-2 products must be simul- 
taneously expressed in order to further metabolize DBT-sul- 
fone. 

J. pENOK-20: In order to evaluate the function of ORFs 2 
15 and 3 separately from ORF 1, DNA spanning ORFs 2 and 3 was 
amplified by the Polymerase Chain Reaction (PCR) . Primers 
RAP-1 (5'-GCGAATTCCGCACCGAGTACC-3\ bps 2062-2082) and RAP- 
2 (5 1 -ATCCATATGCGCACTACGAATCC-3 1 bps 4908-4886) were syn- 
thesized with the Applied Biosystems 392 DNA/RNA Synthes- 
20 izer. Nucleotides in bold were altered from the template 
sequence in order to create restriction sites for subclon- 
ing; thus primer RAP-l contains an Eco RI site, and primer 
RAP-2 contains an Ndel site. Amplification was carried out 
with the GeneAmp Kit (Perkin Elmer Cetus) which utilizes 
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the Taq polymerase and the Perkin Elmer Cetus 9600 Thermo- 
cycler. Parameters were as follows: 

Template: pMELV-1 Plasmid DNA 0.2 or 2.0 ng 

Primers: RAP-1 . 0.5 or 0.2 pM 

5 RAP-2 0.5 or 0.2 \iM 

Cycles: 1X@ 96°C 2 min 

25X@ 96*C 30 sec 

52'C 30 sec 

72'C 2 min 

10 Amplification yielded the predicted 2846 bp fragment. 

In order to express the amplified fragment harboring ORFs 2 
and 3, it was ligated to the Xbal/ Eco RI fragment of the 
chloramphenicol resistance gene promoter from Rhodococcus 
f ascians (Desomer et al . : Molecular Microbiology (1992) 6 

15 (16), 2377-2385) to give plasmid pOTTO-1. Ultimately, a 
blunt end ligation was used for the subcloning of the 
amplified product due to the fact that ligation using the 
engineered restriction sites was unsuccessful. This fusion 
was ligated to shuttle-vector pRR-6 to produce plasmid 

20 pENOK-20. CPE648 transf ormants of pENOK-20 were grown in 
the presence of DBT and 25 yg/ml chloramphenicol for pro- 
moter induction. All transf ormants converted DBT to DBT- 
sulfone presumably through the activity of the ORF 3 as 
demonstrated in subclone pENOK-3. The inability to further 

25 process DBT-sulfone with the presence of ORF 2 suggests 

that the product of ORF 2 alone is incapable of using DBT- 
sulfone as a substrate. This is consistant with results 



WO 94/01563 



PC17US93/06497 



-53- 

obtained from pENOK- Nsi . and suggests that ORF-2 alone is 
incapable of using DBT-sulfone as a substrate. 

Assignment of Gene Products of ORFs 1, 2 and 3 

Based on the foregoing subclone analyses, functions 
5 have been tentatively assigned to each of the ORFs present 
within the pTOXI-1 sequence. ORF 3 can be identified as 
responsible for an oxidase capable of conversion of DBT to 
DBT-sulfone. Subclone pENOK-3 demonstrates this activity 
very clearly. ORFs 1 and 2 appear to be responsible for 

10 conversion of DBT-sulfone to 2-HBP. This aryl sulfatase 
activity is evidenced in subclone pENOK-1. However sub- 
clones pENOK-19 and pENOK-20 indicate that neither ORF 1 o 
ORF 2 alone is capable of any conversion of the inter- 
mediate DBT-sulfone. This suggests that the protein pro- 

15 ducts of ORFs 1 and 2 work together. to cleave both of the 
carbon-sulfur bonds. Presumably, this is achieved through 
a heterodimer arrangement of the proteins, or through a 
regulatory function of one protein on the other. The 
results of paralell investigations, presented in Example 3 

20 suggested that ORF-1 encodes an enzyme that converts DBT — 
sulfone to DBT-sultone. Lengthy incubations of CPE-64 8 
harboring pENOK-19 (intact native promoter and ORF-I) have 
shown neither the depletion of DBT-sulfone nor the pro- 
duction of any new products. This is contrary to indi- 

25 cations derived from Example 3. 
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Alternative Promoter Screening 

Increasing the specific activity of desulfurization is 
a significant objective of the studies described herein. 
One approach to accomplishing this goal is to replace the 
5 original promoter with one that can produce both higher and 
constitutive expression of the desulfurization gene clus- 
ter. Because there are so few reported and characterized 
Rhodococcus p romoters . random genomic libraries have been 
prepared and screened for promoter activity in two systems. 
10 In one, the reporter is the chloramphenicol resistance gene 
used in the above-discussed plasmid constructions. In the 
other, the desulfurization cluster itself is used as a 
reporter. 

Promoter Screening Example A. Chloramphenicol Resistance 
15 Reporter. 

As also described below, partially digested Rhodo- 
coccus genomic DNA has been cloned upstream of a promo- 
terless chloramphenicol resistance gene. The resulting 
libraries were then transformed into Rhodococcus which are 

20 subjected to chloramphenicol selection. Four apparent 
promoter elements were rescued by pRHODOPRO-2 , although 
plasmid could be isolated from only one of these, possibly 
due to vector instability. The stable plasmid RP2-2A has 
been subjected to sequence analysis. Technical problems 

25 have been observed with restriction enzyme treatment of the 
Narl cloning site used in these vectors. Unfortunately, 
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the Na£l_enzyme demonstrates severe site-selectivity and 
does not appear to digest the vector well. New vectors 
have been constructed in order to alleviate this problem, 
although a lack of convenient and unique restriction sites 
5 slowed the progress of these studies, A recent observation 
on the Rhodococcus replication origin will aid in con- 
structing a more effective promoter probe, as discussed 
below. 

Recently, the 1.4kb Ball I fragment ■ was removed from 
10 pRR-6, and the ends were blunted and religated to produce 
pRR-12 (Figure 9), which contains no Bal ll sites. Desomer 
et_al^__ (Molecular Microbiology (1992) 6 (16), 2377-2385) 
reported that this region was needed for plasmid repli- 
cation. Thus, it was surprising that this construct was 
15 capable of producing Cm r trans formants , indicating that 
this region was not essential for plasmid replication in 
the strain of organisms used for the present studies. This 
observation forms the conceptual basis for construction of 
a vector that will utilize a synthesized Ball I site for 
20 cloning the random genomic fragments. Bal l I accepts DNA 
digested by S^u3A, an effective and frequent cutter of 
IGTS8 DNA. These constructs are expected to allow for the 
production of better, more representative random genomic 
libraries. 
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Promoter Screening Example B: Desulfurization Cluster 
Reporter. 

Vector pKAMI has been used as a second direct "shot — 
gun" approach to finding a suitable alternative promoter 
5 (Figure 10) ♦ An Ed£l site was engineered upstream of the 
promoterless Dsz cluster to serve as the site of insertion 
of random genomic DNA (from strains GPE-362, CPE-648 and 
IGTS8) fractionated by Nde l and the compatible 4bp cutters 
Wsel and Bfa l . Originally, this ligation mixture was 

10 directly transformed into GPE-362 cells, which were then 

used en masse to inoculate 250 ml BSM + DBT. These efforts 
were undertaken with the goal of amplifying a superior Dsz+ 
strain due to its ability to utilize DBT as the sole source 
of sulfur. To date, 14 transformations of this type have 

15 been done. Of these, all but 2 have resulted in producing 
Dsz+ cultures. Eleven individual clones have been isolated 
and characterized. These are capable of low-level (0.6 - 
1.0 mg/L 2-HBP/OD 60a /hr) , constitutive expression of the 
desulfurization trait. Restriction analysis of plasmids 

2 0 isolated from these eleven has revealed that all but one 
(KB4-3) are simple rearrangements of the pKAMI backbone 
resulting in gratuitous expression from vector borne pro- 
moters. Many of the rescued plasmids show identical re- 
striction patterns although originating from separate 

25 ligations, suggesting an inherent vector instability. It 
appears as if, with this type of selection, rearrangements 
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of pKAMI that utilize a vector promoter sequence are stron- 
gly selected. 

The above-described selection procedure has thus given 
way to a promoter screen geared to minimize the plasmid 
5 rearrangement. In this procedure, the pKAMI/genomic li- 
brary is first amplified in E. coli . then the individual 
JM109 colonies are pooled together. The plasmids are 
extracted, and used to transform Dsz- strain GPE-362. 
Instead of using en masse enrichment,, the GPE362 transf or- 

10 mations are plated to Rich Media + chloramphenicol for 

selection of plasmid containing cells. Resulting colonies 
are replica-plated to BSM agarose + DBT plates, then 
checked for desulfurizatton activity by UV fluorescence 
production. Over 7,000 GPE-362 transf ormants have been 

15 screened in this fashion. Thirty-six have been isolated 
from these which produce UV fluorescence on BSM + DBT 
plates. Current efforts focus on the identification and 
characterization of the engineered plasmids borne by these 
3 6 transf ormants. 

2 0 Alterna tive Promotor Engineering 

The close physical arrangement of the three ORFs of 
pTOXI-1 does not provide sufficient space for promoters for 
either ORFs 2 or 3. This fact, coupled with the results of 
the subclone analysis in which intact ORFs 2 and 3 provided 

25 no activity (see pENOK-2, pENOK-li, and pENOK-13) , sug- 
gested that this cluster of genes is organized as an operon 
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with only one promoter for expression of the three genes. 
Given that the desulfurization trait of I GTS 8 is repressed 
by sulfate (Kilbane and Bielaga, Final Report D.O.E. Con- 
tract No. DE-AC22-88PC8891 (1991), it is possible that the 
5 operon promoter is tightly controlled by sulfur levels, 
with the elucidation of the molecular arrangement of the 
desulfurization cluster, alternative promoters can be 
rationally engineered to eliminate the sulfur repression, 
increase expression of the desulfurization genes and there- 
10 by increase the specific activity of the Dsz* trait. 



phenicol resistance gene promoter from Rhodococcus fascians 
(Desomer et al . : Molecular Microbiology (1992) 6 (16), 

15 2377-2385), the nitrile hydratase gene promoter from Rhodo- 
coccus rhodochrous (Kobayashi, et al . r Biochimica et Bio- 
physica Acta, 1129 (1991) 23-33), or other strong promoters 
isolated from Rhodoco ccus sp. by "shot-gun" promoter prob- 
ing. Other potential alternative promoters include those 

20 from other Gram positive organisms such as Corvnebacterium . 
Bacillus. Streptomvces . and the like. 

Promoter Engineering Example A: Expression from the chlo- 
ramphenicol resistance gene promoter from Ehodococcus 
fascians . 

25 pSBG-2 (Figure 11). The promoterless desulfurization 

cluster was isolated from pTOXI-1 as a 4.0 kb 5raI/SnaBl 



Examples of potential alternative promoters include 
other known and described promoters such as the chloram- 
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fragment and ligated to a unique blunted Af I II site of pRR- 
6. This ligation inserted the cluster downstream of the 
chloramphenicol resistance gene promoter and upstream of 
the resistance structural gene. Thus, messenger RNA (mRNA) 
5 transcription should proceed through the Dsz gene cluster 
and proceed on to the resistance gene. However, original 
selections of transf ormants on chloramphenicol did not 
yield transf ormants, suggesting poor transcriptional read- 
through. Dsz+ transf ormants harboring the plasmid were 

10 selected first through sulfur bioavailability assays and 

secondarily on chloramphenicol plates. Unlike IGTS8, pSBG- 
2 transformants are capable of converting DBT to 2-HBP in 
BSM media supplemented with 20 mM Na 2 S0 4 , which demon- 
strates the removal of sulfate repression by promoter 

15 replacement. Specific activity, of transformants was mea- 
sured between 0,9 and 1.7 mg 2-HBP/l/OD 600 /hr for a 16 hr 
culture in a rich media (RM) supplemented with 25 pg/ml 
chloramphenicol . 

pSBG-3. The Rhodococcus origin of replication was 

20 removed from pSBG-2 by elimination of the 4.0 kb Xbal 

fragment. Without the origin, transformation is obtainable 
only through integration. CPE-648 transformants with this 
plasmid were selected on RM + chloramphenicol and repli- 
ca-plated onto BSM + DBT plates. Colonies were obtained 

25 which produced 2-HBP, as detected by fluorescence after 18 
hr of incubation at 30<>C. 
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Individual expression Of each ORF 

Recently, studies have been initiated to express the 
three ORFs separately, each engineered with an alternative 
promoter. These studies are expected to elucidate the 
5 following: First, any potential rate limiting steps in the 
desulfurizaf ion process will be identified and overcome. 
Potential polarity effects of operon expression, i.e. 
poorer expression of downstream ORFs 2 and 3, may be caus- 
ing such rate limitations. Also, given the unresolved 

10 issue of the individual functions of ORFs 1 and 2, these 
studies are expected to demonstrate reconstitution of 
DBT-sulfone to 2-HBP conversion by the separate expression 
of ORFs 1 and 2. 

All ORFs were isolated through PCR amplification and 

15 subsequent subcloning. A typical Shine-Dalgarno sequence 
and a unique cloning site for alternative promoters has 
been engineered upstream of each ORF. Stop codons in all 
reading frames have been engineered downstream of each ORF 
to prevent read-through. Additionally, convenient flankng 

20 restriction sites for mobilization of the promoter/ORF 

fusions have been added to each primer. The primers used 
for amplification of each ORF are listed below. In-frame 
stop codons are marked with an aster ik (*) . Sequences 
identical to pTOXI-I template DNA are shown in bold. 
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ORF1UP: 

Xbal 



5 ' -G5MT2CTAGACMATGAGGAACAGACCATSACTCAACAACGACAAATGC-3 • 
EcoRI Ndel ; Start 

5 ORF1DOWN: 

Stop Xba^ 

3 ' -GTACIGTTCGGCGCAGCTGGGGACTAAGAT CTTAAG C-5 ' 
Stop* Stop EcoRI 

ORF2UP: 

10 Balll 

5 ' -GGAAT^AGATCTCA^TGAGGAAACAGACCATGACAAGCCGCGTCGACC-3 » 
EcoRI Ndel start 

ORF2 DOWN : 

StopBalTT 

15 3 - CGGAGTTAGCGGTGGCTAICCTTAM£T AGACTTAAGC- 5 ■ 

Stop* Stop Balll 

ORF3UP: 

Msel 



5 ' -GGAAT^TTAACj^A^AGGAAACAGACCAIGACACTGTCACCTGA- 3 
20 EcoRI Ndel Start 

ORF3DOWN: 

Msel 

3 ' -GACTCCTAGACTCCGCGACTAATT CTTAAG C-5 ' 
Stop* Stop Stop EcoRI 



25 Cycling parameters were: 1 x 96©c 2.0 min 

25 x 96oc 30 sec 
50oc 30 sec 
72oc 1.0 min 

Each ORF has been successfully amplified and subcloned 
3 0 into pUC-19 Ndel as EcoRI fragments. Alternative promoters 
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will be ligated into the unique Ndel sites, and the fusions 
will be moved to Rhodococcus-E. coli shuttle vector pRR-6 for 
expression in Rhodococcus. 

Heterologous Expression of the Dsz+ Trait 
5 In order to determine whether plasmid pTOXI-1 contained 

all of the genetic material necessary for the Dsz+ trait, 
heterologous expression of pTOXI-1 was attempted in Rhodo- 
coccus fascians . a related organism which does not metabolize 
DBT (Dsz-) and in E. coli . a non-related organism which is 
10 also Dsz-. 

A. Rhodococcus fascians (ATCC 12974), a Dsz- strain, was 
transformed with pTOXI-1. A single transformant demonstrated , 
UV fluorescence on BSM + DBT plates, and further analysis by 
HPLC clearly indicated production of 2-HBP when DBT was 

15 provided as a substrate. Thus pTOXI-1 contains sufficient 

information to convert a heterologous Dsz- strain to the Dsz+ 
phenotype. 

B. E. coli strain JM109 was also transformed with pTOXI-1 
and was incubated with each of the substrates DBT and DBT- 

20 sulfone in either a minimal media (BSM) or a rich media 

(Luria. Broth) . In no case was production of 2-HBP observed 
by HPLC analysis. The inability of E . coli to express the 
desulfurization genes was not unexpected as gram positive 
genes are not universally expressible in E. coli without 

25 promoter replacement. 

In order to replace the promoter of the desulfurization 
cluster, a 4.0 kb fisal/SnaBI fragment was isolated from 
pTOXI-1. This fragment contains all of the necessary struc- 
tural genes but lacks the promoter sequences. This pro- 

30 moterless desulfurization cluster was ligated to E. coli 

expression vector pDR540 (Pharmacia, Piscataway, NJ) cut with 
lamHI and ends made blunt with Klenow. The construction 
fuses the tac promoter to the desulfurization cluster. The 
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tac promoter is under control of the lactose repressor and is 
repressed in a lad** host such as JM109. Expression from the 
tac promoter is inducible by the addition of isopropyl S-D- 
thiogalactopyranoside (IPTG) ♦ Transformants of JM109 harbor- 
5 ing pDRDsz grown in Luria Broth at 30 *C demonstrate the Dsz+ 
phenotype when incubated with DBT and induced with IPTG. A 
specific activity as high as 1.69 mg 2HBP/l/OD 600 /hr has been 
observed with pDRDsz. Activity is greatly diminished when 
transformants are grown at 37 °C. The highest level of activ- 

10 ity has been observed at lhr post induction. 

The above-described expression of the Dsz+ trait in both 
a related and non-related heterologous host indicates that 
pTOXI-1 carries all of the genetic information required for 
conversion of DBT to 2-HBP. 

15 Successful expression in E. coli p rovided a workable 

system in which the proteins encoded by the desulfurization 
cluster could be identified and characterized. Total protein 
from Dsz+ cells of JMI09 (pDRDsz) was isolated and examined 
on denaturing acrylamide gels. No novel bands could be 

20 detected with Coomassie stain. Cellular fractionation of 
proteins into periplasmic, cytosolic and membrane components 
were also analyzed by Coomassie stained gels. Again, no 
novel bands were detected. Without any purification, the 
newly expressed proteins were apparently levels too low to 

25 easily detect and resolve from background. 

Maxicell An alysis of E. coli harboring pDRDs? 



strain of E. coli e.g. JM109 which harbors a plasmid is grown 
in M9CA medium (Maniatis et al . ) to a density of 2 x 10 8 
cells/ml. Continuously stirred cells were then subjected to 
UV exposure from a Mineralight Lamp Model ■ UVG-254 (Ultro- 
35 vilet Products, Inc., San Gabriel, CA) at a distance of 10 cm 
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for a fluence rate of 0.5 Joules- ro V. Cells were exposed 
for either 60, 90 or 120 seconds. The cells were then incu- 
bated at 37 «C for 16 hours after which they were then washed 
with M9 buffer and suspended in minimal medium lacking sul- 
5 fate. After 1 hour of starvation at 27'C, [ 35 S] methionine 
(>1000 Ci/mmol) (NEN Research Products, Boston, MA) was added 
at a final concentration of 5 jiCi/ml and incubation was 
continued for 1 hour. Cells were collected by centrifugation 
and proteins isolated through a boiled cell procedure (Mani- 
10 atis, et al .) . Proteins were separated on an acrylamide gel. 
After the run, the gel was dried and subjected to autoradio- 
graphy for 3 days. 

Maxicells of JM109 harboring vector pDR540 showed only 
vector marker galactokinase protein. Maxicells of JM109 
harboring vector pDRDsz showed the presence of three novel 
protein bands of sizes which correlated well with the pre- 
dicted molecular weights of the three proteins responsible 
for the Dsz+ trait, as predicted by open reading frame analy- 
sis (see Table 3) . 



15 



Table 3 



25 



Open Reading 
Frame 


Predicted 
Size (kDa) 


Measured 
Size (kDa) 


ORF-1 


49.5 


49.5 


ORF-2 


38.9 


33.0 


ORF-3 


45.1 


45.0 



Data obtained from Maxicell analysis thus indicated 
that the three predicted open reading frames of pTOXI-l 
encode three structural genes which constitute the desul- 
furization phenotype. 

The relative intensity of the three novel bands is 
reflective of both the number of methionine residues and 
the level of translation for each of the proteins. Clear- 
ly, ORF-2 with only 1 Met gives the faintest band. In 
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addition to the incorporation of only a single Met resi- 
' due, E . coli may process the single terminal methionine, 
further reducing the amount of labelled protein. There- 
fore, the low intensity of the ORF-2 band most likely 
5 does not strictly suggest a low level of protein trans- 
lation. 

Interestingly, the ORF furthest from the promoter 
(ORF-3) appears to be present at levels comparable to 
ORF-1, indicating no polar effects in this operon when 
10 expressed in E. coli . It is expected that more signifi- 
cant information regarding protein levels will be ob- 
tained from a similar Maxicell analysis of a Rhodococcus 
sp. host containing plasmid pTOXI-I. Additionally, the 
presence of an ORF-I/ORF-2 heterodimer, postulated above, 
15 may be observable under non-denaturing conditions. 

As required by 37 C.F.R. Section 1.821(f), Ap- 
plicants Attorney hereby states that the content of the 
"Sequence Listing" in this specification in paper form 
and the content of the computer-readable form (diskette) 
20 of the "Sequence Listing" are the same. 

EQUIVALENTS 

Those skilled in the art will recognize, or be able 
to ascertain using no more than routine experimentation, 
*any equivalents to the specific embodiments of the 
2^nvention described herein. These and all other such 
quivalents are intended to be encompassed by the follow- 
ng claims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Energy BioSystems Corporation 

(B) STREET: 3608 Research Forest Drive B-7 

(C) CITY: The Woodlands 

(D) STATE: TX 

(E) COUNTRY: US 

(F) ZIP: 77381 

(G) TELEPHONE: 713-364-6100 

(H) TELEFAX: 713-364-6110 

<ii) TITLE OF INVENTION: Recombinant DNA Encoding A 
Desulfurization Biocatalyst 

(iii) NUMBER OF SEQUENCES : 5 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Hamilton, Brook, Smith and Reynolds, P.C. 

(B) STREET: Two Militia Drive 

(C) CITY: Lexington 

(D) STATE: Massachusetts 

(E) COUNTRY: U.S.A. 

(F) ZIP: 02173 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
{ B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Brook, David E 

(B) REGISTRATION NUMBER: 22,592 

•<C) REFERENCE /DOCKET NUMBER: EBC92-03A 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617-861-6240 

(B) TELEFAX: 617-861-9540 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5535 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
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(ix) FEATURE: 

(A) NAME /KEY : CDS 

<B) LOCATION: 790. .2151 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 3256. .4506 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GCATGCACGT CGCGCCGACG CATTTGCGCG CACGGCTCCG GGCAGTTCTC GCGGCGCTGG 60 

AGGCACGGAT GGGCACCCTC AACGAACTCA CCCAAACCAC GCCGATAGCG ATCCTCGCCG 120 

AAACCCTCGG CTACAGCCCT CAGACATTGG AAGCTCATGC GCGACGCATC CGGATCGACC 180 

TTTGCACGCT ACGTGGCGAC GCGGCTGGAC TGACGCTGGA GGTCCGACCC GACGTGTGTG 240 

GTGTAGCGCC GCTTAACGGG TGCGCACGGC GGGACATCGG CCAGCTGGCT TGCCCCTCCT 300 

CCGCAGGTAG TCGACCACCC CTTCCCGCAG CGGTCGGAGG TGATCGACCG TTAGGGTCAT 360 

TTGCTCGCAG ATCGGCTGAT GTTGCCGATC GACGTGGTCG ACGGGACACG CTCGCGATTG 420 

GCATGGCGTC CGGTGCATAC ACGACGATCT AACCAGATCG ACGGTTTTGA GCGTCGGTCA 480 

ACGTCGACTC GATGCGCCGT GCGAGTGAGA TCCTTTGTGG TGCTTGGCTA TTGACCTCGA 540 

CAAGGATAGA GATTCGAAGG ACCTCGGATC GACCCAAATG CGGACGGCCG GCAGCGGCGA 600 

AGGCGGCCAA GTCATCGGCA CCGTCACCGT CACCTTGACC CGACGTGCCC CGTGGTTCAA 660 

GGCCTGAATT TGGCTGGTGG AGCATTGAAA TCAGGTGAAG TTTAACGGTG GGCACACCCC 720 

GGGGGTGGGG GTGAGACTGC TTAGCGACAG GAATCTAGCC ATGATTGACA TTTAAAGGAC 780 

GCATACGCG ATG ACT CAA CAA CGA CAA ATG CAT CTG GCC GGT TTC TTC 828 
Met Thr Gin Gin Arg Gin Met His Leu Ala Gly Phe Phe 
15 10 

TCG GCC GGC AAT GTG ACT CAT GCA CAT GGG GCG TGG CGG CAC ACG GAC 876 
Ser Ala Gly Asn Val Thr His Ala His Gly Ala Trp Arg His Thr Asp 
15 20 25 

GCG TCG AAT GAC TTT CTG TCG GGG AAG TAC TAC CAA CAC ATC tJCC CGT 924 
Ala Ser Asn Asp Phe Leu Ser Gly Lys Tyr Tyr Gin His lie Ala Arg 
30 35 40 45 

ACT CTG GAG CGC GGC AAG TTC GAT CTG TTG TTT CTG CCT GAC GGG TTG 972 
Thr Leu Glu Arg Gly Lys Phe Asp Leu Leu Phe Leu Pro Asp Gly Leu 
50 55 60 

GCC GTC GAG GAC AGC TAC GGG GAC AAC CTG GAC ACC GGT GTC GGC CTG 1020 
Ala Val Glu Asp Ser Tyr Gly Asp Asn Leu Asp Thr Gly Val Gly Leu 
65 70 75 

GGC GGG CAG GGT GCA GTC GCC TTG GAG CCG GCC AGT GTG GTC GCA ACC 1068 
Gly Gly Gin Gly Ala Val Ala Leu Glu Pro Ala Ser Val Val Ala Thr 
80 85 90 

ATG GCC GCG GTG ACC GAG CAC CTG GGT CTT GGG GCA ACC ATT TCG GCG 1116 
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„« Al. «. ,.1 »r «. «U « .IT - «, AJa IU •« Ma 

ACC T« TAT CCC CC= TAT CAC CTT OCT CGC CTG «e «J »g « .«« 

Thr Tyr Tyr Pro Pro Tyr His Val Ala Arg ^ 125 
110 115 

CAG TTG TCA GCG GOT COG GTG TCC TCC AAC CTC CTC ACC TCG CTC AAC 
Gin Leu Ser Gly Gly Arg Val Ser Trp Asn vai 14Q 

130 135 
CAC OCT CAA OCC COC AAC TTC CCC ATT AAT CAC CAT CTG CAA CAC CAC 
Asp Ala Glu Ala Arg Asn Phe Gly lie ash v, ^ 
145 lsu 

TTC TTG GAA GCG GTC AAG AAA CTC 

CCC CGC TAT GAC CGC CCC GAT GAG TTC TTG GAA ^ Lyg ^ 

Ala Arg Tyr Asp Arg Ala Asp Glu Phe mu ^ q 

160 16,5 
TOO AAC ACC «. CAC CAC CAC OCC CTC CTC CTO CAC AAC CCG OCC COC 
Trp Asn Ser Trp Asp Glu Asp Aia i-«u ^ 

GTG « GCC GAT CCC GCG AAG GTG CAC TAC GTC GAT CAC CAC «0 «0 

Val Phe Ala Asp Pro Ala Lys Val His Tyr vax p 2Q5 
190 195 

5 = 2 2 |i S 2 2 S S 2 2 = S i " 

2 2 515 s = 2 s s i s s 2 s s s s 

OCC COC AAG TGG GCC OAO OCC CTC TTC ACT CTT CCA CCC AAC CTC GAG 

Ala Gly Lys Trp Ala Glu Ala Val Phe se ^ 
240 243 

2 s s s s 2 a s s E E s s 2 s s 

255 260 

~ itc TTC ACC GCC GTG ATG CCG GTA 

GGG CGC GAT CCC GAT CAG ACG AAA ATC TTC ACC G ^ ^ ^ 

Gly Arg Asp Pro Asp Gin Thr Lys lie Phe Tnr 2Q5 
270 275 

25 SS 2 2 a S S3 S 2 2 2 2 Si - 2 2 
S = 2 2 2 2 2 2 2 2 S 2 = = 2 2 

305 JiU 
CGC ATC AAC CTG GCG GCG TAC OCT CTC GAC ACT « ATC AAG GAC ATC 

Gly He Asn Leu Ala Ala Tyr Pro Leu asp 33Q 

320 3 " 
CTG COG OAT CTO CAO OAT COG AAT GTC CCG ACC CAA CTO CAC ATG TTC 

Leu Arg Asp Leu Gin Asp Arg Asn Val Pro Thr ^ 



1164 



1212 



1260 



1308 



1356 



1404 



1452 



1500 



1548 



1596 



1644 



1692 



1740 



1788 



1836 



335 340 
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1884 



^ trr CTG GCG GAA ATG GGT CGG 

S S S S SS S SK !K S S 2- - - - 53 
S S S S K « « = S I S 5 S S » S 

s- s s s s « = s I ss = = « I s s 
s ss s S s s ss s a = ss s £ - = 85 

400 ~m CAG GAT CGC GGC TAC TTC CGC 2076 

S S SS SS SS !S S S3 S SS =5 5, - * - - 

~ - - ss s; ss s ss ss ss ss s: ss s: ss a 2124 . 

Thr Glu Tyr Gin Cly JJJ in 440 

» « e» ccr tca tg*=«gc=g oBC-eceo gc«accccg »i« 

rm CAA CTG CAA GGA CAA cui ^ 
Pro c£n Leu Gin oly Gin Pro ser 

■—.===== " 

TT0 GGG=TG* GATCCCGCCA CTGCTCAG mcsccccg;i 
onrrcCGCTC TTGGGGCGCC AGGGCTTCXl 
" ^» ~ CTCGGCCTCG OC^C, 

TCHCMCOOC MCCGMCH MTtoe »TCC CTGGCGGCAA ACGCTGGTAG 

— — ™zz ~ ~ — « — — 

« ~ GTCCTGGTGT CC^=C «— 
1 n^TC .CCTCTTTCC CGXTGXCCGC CGCGGTCAGO 

CCGCGGTGTT GGCCAO- =™ 
-»»- CRCCGGGGCC =G=CC. 

~ rrr= " — - «— —~ 

GACTGGTCGA CGCGGCCG 

rrr: — — - — 



2298 
2358 
2418 
2478 
2538 
2598. 
2658 
2718 
2778 
2838 
2898 
2958 
3018 
3078 
3138 
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GCACACAGCA ATTCCTGCTC ACCAACAACT TGCTGCAGGA ACCCGTCGCC CTCGATCAGT 3198 

GGGCGGCTCC GGAATTTCTG AACAACAGCC TCAATCGCCA CCGATAGGAA CATCCGC 3255 

ATG ACA CTG TCA CCT GAA AAG CAG CAC GTT CGA CCA CGC GAC GCC GCC 3303 
Met Thr Leu Ser Pro Glu Lys Gin His Val Arg Pro Arg Asp Ala Ala 
15 10 15 

GAC AAC GAT CCC GTC GCG GTT GCC CGT GGG CTA GCC GAA AAG TGG CGA 3351 
Asp Asn Asp Pro Val Ala Val Ala Arg Gly Leu Ala Glu Lys Trp Arg 
20 25 30 

GCC ACC GCC GTC GAG CGT GAT CGC GCC GGG GGT TCG GCA ACA GCC GAG 3399 
Ala Thr Ala Val Glu Arg Asp Arg Ala Gly Gly Ser Ala Thr Ala Glu 
35 40 45 

CGC GAA GAC CTG CGC GCG AGC GCG CTG CTG TCG CTC CTC GTC CCG CGC 3447 
Arg Glu Asp Leu Arg Ala Ser Ala Leu Leu Ser Leu Leu Val Pro Arg 
50 55 60 

GAA TAC GGC GGC TGG GGC GCA GAC TGG CCC ACC GCC ATC GAG GTC GTC 3495 
Glu Tyr Gly Gly Trp Gly Ala Asp Trp Pro Thr Ala lie Glu Val Val 
65 70 75 80 

CGC GAA ATC GCG GCA GCC GAT GGA TCT TTG GGA CAC CTG TTC GGA TAC 3543 
Arg Glu He Ala Ala Ala Asp Gly Ser Leu Gly His Leu Phe Gly Tyr 
85 90 95 

CAC CTC ACC AAC GCC CCG ATG ATC GAA CTG ATC GGC TCG CAG GAA CAA 3591 
His Leu Thr Asn Ala Pro Met He Glu Leu He Gly Ser Gin Glu Gin 
100 105 HO 

GAA GAA CAC CTG TAC ACC CAG ATC GCG CAG AAC AAC TGG TGG ACC GGA 3639 
Glu Glu His Leu Tyr Thr Gin He Ala Gin Asn Asn Trp Trp Thr Gly 
115 120 125 

AAT GCC TCC AGC GAG AAC AAC AGC CAC GTG CTG GAC TGG AAG GTC AGC 3687 
Asn Ala Ser Ser Glu Asn Asn Ser His Val Leu Asp Trp Lys Val Ser 
130 135 140 

GCC ACC CCG ACC GAA GAC GGC GGC TAC GTG CTC AAT GGC ACG AAG CAC 3735 
Ala Thr Pro Thr Glu Asp Gly Gly Tyr Val Leu Asn Gly Thr Lys His 
145 150 155 160 

TTC TGC AGC GGC GCC AAG GGG TCG GAC CTG CTG TTC GTG TTC' GGC GTC 3783 
Phe Cys Ser Gly Ala Lys Gly Ser Asp Leu Leu Phe Val Phe Gly Val 
165 170 175 

GTC CAG GAT GAT TCT CCG CAG CAG GGT GCG ATC ATT GCT GCC GCT ATC 3831 
Val Gin Asp Asp Ser Pro Gin Gin Gly Ala He He Ala Ala Ala He 
180 185 190 

CCG ACA TCG CGG GCT GGC GTT ACG CCC AAC GAC GAC TGG GCC GCC ATC 3879 
Pro Thr Ser Arg Ala Gly Val Thr Pro Asn Asp Asp Trp Ala Ala He 
195 200 205 

GGC ATG CGG CAG ACC GAC AGC GGT TCC ACG GAC TTC CAC AAC GTC AAG 3927 
Gly Met Arg Gin Thr Asp Ser Gly Ser Thr Asp Phe His Asn Val Lys 
210 215 220 

GTC GAG CCT GAC GAA GTG CTG GGC GCG CCC AAC GCC TTC GTT CTC GCC 3975 
Val Glu Pro Asp Glu Val Leu Gly Ala Pro Asn Ala Phe Val Leu Ala 
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225 230 235 240 

TTC ATA CAA TCC GAG CGC GGC AGC CTC TTC GCG CCC ATA GCG CAA TTG 4023 
Phe He Gin Ser Glu Arg Gly Ser Leu Phe Ala Pro He Ala Gin Leu 
245 250 255 

ATC TTC GCC AAC GTC TAT CTG GGG ATC GCG CAC GGC GCA CTC GAT GCC 4071 
He Phe Ala Asn Val Tyr Leu Gly He Ala His Gly Ala Leu Asp Ala 
260 265 270 

GCC AGG GAG TAC ACC CGT ACC CAG GCG AGG CCC TGG ACA CCG GCC GGT 4119 
Ala Arg Glu Tyr Thr Arg Thr Gin Ala Arg Pro Trp Thr Pro Ala Gly 
275 280 285 

ATT CAA CAG GCA ACC GAG GAT CCC TAC ACC ATC CGC TCC TAC GGT GAG 4167 
He Gin Gin Ala Thr Glu Asp Pro Tyr Thr He Arg Ser Tyr Gly Glu 
290 295 300 

TTC ACC ATC GCA TTG CAG GGA GCT GAC GCC GCC GCC CGT GAA GCG GCC 4215 
Phe Thr He Ala Leu Gin Gly Ala Asp Ala Ala Ala Arg Glu Ala Ala 
305 310 315 320 

CAC CTG CTG CAG ACG GTG TGG GAC AAG GGC GAC GCG CTC ACC CCC GAG 4263. 
His Leu Leu Gin Thr Val Trp Asp Lys Gly Asp Ala Leu Thr Pro Glu 
325 330 335 

GAC CGC GGC GAA CTG ATG GTG AAG GTC TCG GGA GTC AAA GCG TTG GCC 4311 
Asp Arg Gly Glu Leu Met Val Lys Val Ser Gly Val Lys Ala Leu Ala 
340 345 350 

ACC AAC GCC GCC CTC AAC ATC AGC AGC GGC GTC TTC GAG GTG ATC GGC 4359 
Thr Asn Ala Ala Leu Asn He Ser Ser Gly Val Phe Glu Val He Gly 
355 360 365 

GCG CGC GGA ACA CAT CCC AGG TAC GGT TTC GAC CGC TTC TGG CGC AAC 4407 
Ala Arg Gly Thr His Pro Arg Tyr Gly Phe Asp Arg Phe Trp Arg Asn 
370 375 380 

GTG CGC ACC CAC TCC CTG CAC GAC CCG GTG TCC TAC AAG ATC GCC GAC 4455 
Val Arg Thr His Ser Leu His Asp Pro Val Ser Tyr Lys He Ala Asp 
385 390 395 400 

GTC GGC AAG CAC ACC TTG AAC GGT CAA TAC CCG ATT CCC GGC TTC ACC 4503 
Val Gly Lys His Thr Leu Asn Gly Gin Tyr Pro He Pro Gly Phe Thr 
405 410 415 

TCC TGAGGATCTG AG GCG CTG AT CG AGG CCG AG GCCACCGCGC GGCCGAGTCG 4556 
Ser 



CGAATCGCCC 


GCCGATACTC 


AGCTTCTCCA 


TACGTACGGG 


TGCACACAAG 


GAGATATTGT 


4616 


CAAGACCTGT 


GGATGAGGGT 


GTTTCAGGCG 


ACCTCCGTTT 


CGCTTGATTC 


GTCGGGCTCA 


4676 


GCGGGTGAGA 


TGTCGATGGG 


TCGTTCGAGC 


AGCTTGCCTT 


TGTGGAACAC 


CGCGCCGGCA 


4736 


CGGACCAGCG 


CGACCAGATG 


GGGGGCGTTG 


ACCGCCGCCA 


GCGGG CTTGT 


GCGGCGTCGA 


4796 


TCAGCTTGTA 


GGCCATGGCA 


ATCCCGCTGC 


GACGTGACCC 


AGGGCCCTTG 


GTGACCTTGG 


4856 



TTCGCAACCG CACGGTCGCA 



AACGTCGATT 



CGATCGGATT CGTAGTGCGC AAGTGGATCC 4916 
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AGTGCTCGGC 


CGGGTACCGG 


TAGAACTCCA 


GGAGCACGTC GGCGTCGTCG 


ACGATCTTGG 


4976 


CGACCGCCTT 


GGGGTACTTC 


GCGCCGTAAT 


CTACCtCGAA GGCCTTGATC GCGACCTGGG 


5036 


CCTTGTCGAT 


GTCCTCGGCG 


TTGTAGATTT 


CCCGCATCGC 


CGCGGTCGCA 


CCTGGATGAG 


5096 


CCGACTTGGG 


CAGCGCAGCA 


AGCACATTGG 


CCTGCTTGTG 


AAACCAGCAG 


CGCTGTTCAC 


5156 


GGGTATCCGG 


AAACACCTCC 


CGCAGTGCCT 


TCCAGAACCC 


CAGCGCCCCA 


TCACCGACGG 


5216 


CCAGCACCGG 


GGCGGTCATC 


CCGCGGCGTC 


GGCATGAGCG 


CAGCAGATCA 


GCCCACGACT 


5276 


CTGTGGACTC 


CCGGAACCCA 


TCGGTGAGCG 


CGACGAGCTC 


CTTGCGGCCG 


TCGGCGCGGA 


5336 


CGCCGATCAT 


CACGAGCAAG 


CACAGCTTCT 


CCTGCTCCAG 


GCGGACATTG 


AGATGGATGC 


5396 


CGTCGACCCA 


TAGGTACACG 


AAATCGGTGC 


CCGAGAGATC 


CCGGTCGGCG 


AAGGCCTTCG 


5456 


CCTCGTCCTG 


CCATTGCGCG 


GTCAGCCGGG 


TGATCGTCGA 


GGCCGACAGC 


CCGGCACCAG 


5516 


TGCCGAGGAA 


CTGCTCCAA 










5535 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 454 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Thr Gin Gin Arg Gin Met His Leu Ala Gly 
15 10 

Asn Val Thr His Ala His Gly Ala Trp Arg His 
20 25 

Asp Phe Leu Ser Gly Lys Tyr Tyr Gin His lie 
35 40 

Arg Gly Lys Phe Asp Leu Leu Phe Leu Pro Asp 
50 55 

Asp Ser Tyr Gly Asp Asn Leu Asp Thr Gly Val 
65 70 75 

Gly Ala Val Ala Leu Glu Pro Ala Ser Val Val 
85 90 

Val Thr Glu His Leu Gly Leu Gly Ala Thr He 
100 105 

Pro Pro Tyr His Val Ala Arg Val Phe Ala Thr 
115 120 

Gly Gly Arg Val Ser Trp Asn Val Val Thr Ser 
130 135 

Ala Arg Asn Phe Gly He Asn Gin His Leu Glu 



Phe Phe Ser Ala Gly 
15 

Thr Asp Ala Ser Asn 
30 



Ala Arg Thr Leu Glu 
45 



Gly Leu Ala Val Glu 
60 

Gly Leu Gly Gly Gin 
80 

Ala Thr Met Ala Ala 
95 



Ser Ala Thr Tyr Tyr 
110 

Leu Asp Gin Leu Ser 
125 

Leu Asn Asp Ala Glu 
140 

His Asp Ala Arg Tyr 
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145 



150 



155 



160 



Asp Arg Ala Asp Glu Phe Leu Glu Ala Val Lys Lys Leu Trp Asn Ser 
165 170 175 

Trp Asp Glu Asp Ala Leu Val Leu Asp Lys Ala Ala Gly Val Phe Ala 
180 185 190 

Asp Pro Ala Lys Val His Tyr Val Asp His His Gly Glu Trp Leu Asn 
195 200 205 

Val Arg Gly Pro Leu Gin Val Pro Arg Ser Pro Gin Gly Glu Pro Val 
210 215 220 

lie Leu Gin Ala Gly Leu Ser Pro Arg Gly Arg Arg Phe Ala Gly Lys 
225 230 235 240 

Trp Ala Glu Ala Val Phe Ser Leu Ala Pro Asn Leu Glu Val Met Gin 
245 250 255 

Ala Thr Tyr Gin Gly lie Lys Ala Glu Val Asp Ala Ala Gly Arg Asp 
260 265 270 

Pro Asp Gin Thr Lys lie Phe Thr Ala Val Met Pro Val Leu Gly Glu 
275 280 285 

Ser Gin Ala Val Ala Gin Glu Arg Leu Glu Tyr Leu Asn Ser Leu Val 
290 295 300 

His Pro Glu Val Gly Leu Ser Thr Leu Ser Ser His Thr Gly lie Asn 
305 310 315 320 

Leu Ala Ala Tyr Pro Leu Asp Thr Pro lie Lys Asp He Leu Arg Asp 
325 330 335 

Leu Gin Asp Arg Asn Val Pro Thr Gin Leu His Met Phe Ala Ala Ala 
340 345 350 

Thr His Ser Glu Glu Leu Thr Leu Ala Glu Met Gly Arg Arg Tyr Gly 
355 360 365 

Thr Asn Val Gly Phe Val Pro Gin Trp Ala Gly Thr Gly Glu Gin He 
370 375 380 

Ala Asp Glu Leu He Arg His Phe Glu Gly Gly Ala Ala Asp Gly Phe 
385 390 395 400 

He He Ser Pro Ala Phe Leu Pro Gly Ser Tyr Asp Glu Phe Val Asp 
405 410 415 

Gin Val Val Pro Val Leu Gin Asp Arg Gly Tyr Phe Arg Thr Glu Tyr 
420 425 430 

Gin Gly Asn Thr Leu Arg Asp His Leu Gly Leu Arg Val Pro Gin Leu 
435 440 445 



Gin Gly Gin Pro Ser 
450 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 416 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Thr Leu Ser Pro Glu Lys Gin His Val Arg Pro Arg Asp Ala Ala 
15 10 15 

Aso Asn Asp Pro Val Ala Val Ala Arg Gly Leu Ala Glu Lys Trp Arg 
20 25 30 

Ala Thr Ala Val Glu Arg Asp Arg Ala Gly Gly Ser Ala Thr Ala Glu 
35 40 45 

Arg Glu Asp Leu Arg Ala Ser Ala Leu Leu Ser Leu Leu Val Pro Arg 
50 55 60 

Glu Tyr Gly Gly Trp Gly Ala Asp Trp Pro Thr Ala lie Glu Val Val 
65 70 75 80 

Arg Glu lie Ala Ala Ala Aso Gly Ser Leu Gly His Leu Phe Gly Tyr 
85 90 95 

His Leu Thr Asn Ala Pro Met He Glu Leu He Gly Ser Gin Glu Gin 
100 105 110 

Glu Glu His Leu Tyr Thr Gin He Ala Gin Asn Asn Trp Trp Thr Gly 
115 120 125 

Asn Ala Ser Ser Glu Asn Asn Ser His Val Leu Asp Trp Lys Val Ser 
130 135 140 

Ala Thr Pro Thr Glu Asp Gly Gly Tyr Val Leu Asn Gly Thr Lys His 
145 150 155 160 

Phe Cys Ser Gly Ala Lys Gly Ser Asp Leu Leu Phe Val Phe Gly Val 
165 170 175 

Val Gin Asp Asp Ser Pro Gin Gin Gly Ala He lie Ala Ala Ala He 
180 185 190 

Pro Thr Ser Arg Ala Gly Val Thr Pro Asn Asp Asp Trp Ala Ala He 
195 200 205 

Gly Met Arg Gin Thr Asp Ser Gly Ser Thr Asp Phe His Asn Val Lys 
210 215 220 

Val Glu Pro Asp Glu Val Leu Gly Ala Pro Asn Ala Phe Val Leu Ala 
225 230 235 240 

Phe lie Gin Ser Glu Arg Gly Ser Leu Phe Ala Pro lie Ala Gin Leu 
245 250 255 

lie Phe Ala Asn Val Tyr Leu Gly lie Ala His Gly Ala Leu Asp Ala 
260 265 270 
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Ala Arg Glu Tyr Thr Arg Thr Gin Ala Arg Pro Trp Thr Pro Ala Gly 
275 280 285 

lie Gin Gin Ala Thr Glu Asp Pro Tyr Thr lie Arg Ser Tyr Gly Glu 
290 295 300 

Phe Thr He Ala Leu Gin Gly Ala Asp Ala Ala Ala Arg Glu Ala Ala 
305 310 315 320 

His Leu Leu Gin Thr Val Trp Asp Lys Gly Asp Ala Leu Thr Pro Glu 
325 330 335 

Asp Arg Gly Glu Leu Met Val Lys Val Ser Gly Val Lys Ala Leu Ala 
340 345 350 

Thr Asn Ala Ala Leu Asn He Ser Ser Gly Val Phe Glu Val He Gly 
355 360 365 

Ala Arg Gly Thr His Pro Arg Tyr Gly Phe Asp Arg Phe Trp Arg Asn 
370 375 380 

Val Arg Thr His Ser Leu His Asp Pro Val Ser Tyr Lys He Ala Asp 
385 390 395 400 

Val Gly Lys His Thr Leu Asn Gly Gin Tyr Pro He Pro Gly Phe Thr 
405 410 415 

Ser 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5535 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A.) NAME /KEY: CDS 

(B) LOCATION: 2148.. 3245 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GCATGCACGT CGCGCCGACG CATTTGCGCG CACGGCTCCG GGCAGTTCTC GCGGCGCTGG 60 

AGGCACGGAT GGGCACCCTC AACGAACTCA CCCAAACCAC GCCGATAGCG ATCCTCGCCG 120 

AAACCCTCGG CTACAGCCCT CAGACATTGG AAGCTCATGC GCGACGCATC CGGATCGACC 180 

TTTGCACGCT ACGTGGCGAC GCGGCTGGAC TGACGCTGGA GGTCCGACCC GACGTGTGTG 240 

GTGTAGCGCC GCTTAACGGG TGCGCACGGC GGGACATCGG CCAGCTGGCT TGCCCCTCCT 300 

CCGCAGGTAG TCGACCACCC CTTCCCGCAG CGGTCGGAGG TGATCGACCG TTAGGGTCAT 360 

TTGCTCGCAG ATCGGCTGAT GTTGCCGATC GACGTGGTCG ACGGGACACG CTCGCGATTG 420 
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GCATGGCGTC 


CGGTGCATAC 


ACGACGATCT 


AACCAGATCG 


ACGGTTTTGA 


GCGTCGGTCA 


480 


ACGTCGACTC 


GATGCGCCGT 


GCGAGTGAGA 


TCCTTTGTGG 


TGCTTGGCTA 


TTGACCTCGA 


S40 


CAAGGATAGA 


GATTCGAAGG 


ACCTCGGATC 


GACCCAAATG 


CGGACGGCCG 


GCAGCGGCGA 


600 


AGGCGGCCAA 


GTCATCGGCA 


CCGTCACCGT 


CACCTTGACC 


CGACGTGCCC 


CGTGGTTCAA 


660 


GGCCTGAATT 


TGGCTGGTGG 


AGCATTGAAA 


TCAGGTGAAG 


TTTAACGGTG 


GGCACACCCC 


720 


GGGGGTGGGG 


GTGAGACTGC 


TTAGCGACAG 


GAATCTAGCC 


ATGATTGACA 


TTTAAAGGAC 


780 


GCATACGCGA 


TGACTCAACA 


ACGACAAATG 


CATCTGGCCG 


GTTTCTTCTC 


GGCCGGCAAT 


840 


GTGACTCATG 


CACATGGGGC 


GTGGCGGCAC 


ACGGACGCGT 


CGAATGACTT 


TCTGTCGGGG 


900 


AAGTACTACC 


AACACATCGC 


CCGTACTCTG 


GAGCGCGGCA 


AGTTCGATCT 


GTTGTTTCTG 


960 


CCTGACGGGT 


TGGCCGTCGA 


GGACAGCTAC 


GGGGACAACC 


TGGACACCGG 


TGTCGGCCTG 


1020 


GGCGGGCAGG 


GTGCAGTCGC 


CTTGGAGCCG 


GCCAGTGTGG 


TCGCAA.CCAT 


GGCCGCGGTG 


1080 


ACCGAGCACC 


TGGGTCTTGG 


GGCAACCATT 


TCGGCGACCT 


ACTATCCCCC 


GTATCACGTT 


1140 


GCTCGGGTGT 


TCGCGACGCT 


CGATCAGTTG 


TCAGGGGGTC 


GGGTGTCCTG 


GAACGTCGTC 


1200 


ACCTCGCTCA 


ACGACGCTGA 


AGCGCGCAAC 


TTCGGCATTA 


ATCAGCATCT 


GGAACACGAC 


1260 


GCCCGCTATG 


ACCGCGCCGA 


TGAGTTCTTG 


GAAGCGGTCA 


AGAAACTCTG 


GAACAGCTGG • 


1320 


GACGAGGACG 


CCCTCGTGCT 


GGACAAGGCG 


GCCGGCGTGT 


TCGCCGATCC 


CGCGAAGGTG 


1380 


CACTACGTCG 


ATCACCACGG 


GGAGTGGCTG 


AATGTGCGCG 


GACCTCTGCA 


GGTACCGCGT 


1440 


TCACCTCAGG 


GTGAGCCGGT 


GATCCTGCAG 


GCCGGCCTGT 


CGCCCCGGGG 


TCGGCGCTTC 


1500 


GCCGGGAAGT 


GGGCCGAGGC 


CGTCTTCAGT 


CTTGCACCCA 


ACCTCGAGGT 


GATGCAGGCC 


1560 


ACCTACCAGG 


GCATCAAAGC 


CGAGGTCGAC 


GCTGCGGGGC 


GCGATCCCGA 


TCAGACGAAA 


1620 


ATCTTCACCG 


CCGTGATGCC 


GGTACTCGGC 


GAAAGCCAGG 


CGGTGGCACA 


GGAACGACTG 


1680 


GAATATCTCA 


ACAGTCTGGT 


CCATCCGGAA 


GTGGGACTGT 


CGACGCTATC 


CAGTCACACC 


1740 


GGCATCAACC 


TGGCGGCGTA 


CCCTCTCGAC 


ACTCCGATCA 


AGGACATCCT 


GCGGGATCTG 


1800 


CAGGA 1 CtjU A 




GCAACTGCAC 


ATGTTCGCCG 


CCGCAACGCA 


CAGCGAAGAG 


1860 


CTCACGCTGG 


CGGAAATGGG 


TCGGCGCTAT 


GGAACCAACG 


TGGGGTTCGT 


TCCTCAGTGG 


1920 


GCCGGTACCG 


GGGAGCAGAT 


CGCTGACGAG 


CTGATCCGCC 


ACTTCGAGGG 


CGGCGCCGCG 


1980 


GATGGTTTCA 


TCATCTCTCC 


GGCCTTCCTG 


CCGGGCTCCT 


ACGACGAGTT 


CGTCGACCAG 


2040 


GTGGTTCCGG 


TTCTGCAGGA 


TCGCGGCTAC 


TTCCGCACCG 


AGTACCAGGG 


CAACACTCTG 


2100 


CGCGACCACT 


TGGGTCTGCG 


CGTACCACAA 


CTGCAAGGAC 


AACCTTC ATG ACA AGC 
Met Thr Ser 
1 


2156 



CGC GTC GAC CCC GCA AAC CCC GGT TCA GAA CTC GAT TCC GCC ATC CGC 2204 
Arg Val Asp Pro Ala Asn Pro Gly Ser Glu Leu Asp Ser Ala He Arg 
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5 10 15 

GAC ACA CTG ACC TAC AGC AAC TGC CCG GTA CCC AAC GOT CTG CTC ACG 2252 
Asp Thr Leu Thr Tyr Ser Asn Cys Pro Val Pro Asn Ala Leu Leu Thr 
20 25 30 35 

GCA TCG GAA TCG GGC TTC CTC GAC GCC GCC GGC ATC GAA CTC GAC GTC 2300 
Ala Ser Glu Ser Gly Phe Leu Asp Ala Ala Gly lie Glu Leu Asp Val 
40 45 50 

CTC AGC GGC CAG CAG GGC ACG GTT CAT TTC ACC TAC GAC CAG CCT GCC 2348 
Leu Ser Gly Gin Gin Gly Thr Val His Phe Thr Tyr Asp Gin Pro Ala 
55 60 65 

TAC ACC CGT TTT GGG GGT GAG ATC CCG CCA CTG CTC AGC GAG GGG TTG 2396 
Tyr Thr Arg Phe Gly Gly Glu lie Pro Pro Leu Leu Ser Glu Gly Leu 
70 75 80 

CGG GCA CCT GGG CGC ACG CGT CTA CTC GGC ATC ACC CCG CTC TTG GGG 2444 
Arg Ala Pro Gly Arg Thr Arg Leu Leu Gly lie Thr Pro Leu Leu Gly 
85 90 95 

CGC CAG GGC TTC TTT GTC CGC GAC GAC AGC CCG ATC ACA GCG GCC GCC 2492 
Arg Gin Gly Phe Phe Val Arg Asp Asp Ser Pro lie Thr Ala Ala Ala 
100 105 110 115 

GAC CTT GCC GGA CGT CGA ATC GGC GTC TCG GCC TCG GCA ATT CGC ATC 2540 
Asp Leu Ala Gly Arg Arg He Gly Val Ser Ala Ser Ala He Arg He 
120 125 130 

CTG CGC GGC CAG CTG GGC GAC TAC CTC GAG TTG GAT CCC TGG CGG CAA 2588 
Leu Arg Gly Gin Leu Gly Asp Tyr Leu Glu Leu Asp Pro Trp Arg Gin 
135 140 145 

ACG CTG GTA GCG CTG GGC TCG TGG GAG GCG CGC GCC TTG TTG CAC ACC . 2636 
Thr Leu Val Ala Leu Gly Ser Trp Glu Ala Arg Ala Leu Leu His Thr 
150 155 160 

CTT GAG CAC GGT GAA CTG GGT GTG GAC GAC GTC GAG CTG GTG CCG ATC 2684 
Leu Glu His Gly Glu Leu Gly Val Asp Asp Val Glu Leu Val Pro He 
165 170 175 

AGC AGT CCT GGT GTC GAT GTT CCC GCT GAG CAG CTC GAA GAA TCG GCG 2732 
Ser Ser Pro Gly Val Asp Val Pro Ala Glu Gin Leu Glu Glu Ser Ala 
180 185 190 ^ 195 

ACC GTC AAG GGT GCG GAC CTC TTT CCC GAT GTC GCC CGC GGT CAG GCC 2780 
Thr Val Lys Gly Ala Asp Leu Phe Pro Asp Val Ala Arg Gly Gin Ala 
200 205 210 

GCG GTG TTG GCC AGC GGA GAC GTT GAC GCC CTG TAC AGT TGG CTG CCC 2828 
Ala Val Leu Ala Ser Gly Asp Val Asp Ala Leu Tyr Ser Trp Leu Pro 
215 220 225 

TGG GCC GGG GAG TTG CAA GCC ACC GGG GCC CGC CCA GTG GTG GAT CTC 2876 
Trp Ala Gly Glu Leu Gin Ala Thr Gly Ala Arg Pro Val Val Asp Leu 
230 235 240 

GGC CTC GAT GAG CGC AAT GCC TAC GCC AGT GTG TGG ACG GTC AGC AGC 2924 
Gly Leu Asp Glu Arg Asn Ala Tyr Ala Ser Val Trp Thr Val Ser Ser 
245 250 255 
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GGG CTG GTT CGC CAG CGA CCT GGC CTT GTT CAA CGA CTG GTC GAC GCG 2972 
Gly Leu Val Arg Gin Arg Pro Gly Leu Val Gin Arg Leu Val Asp Ala 
260 265 270 275 

GCC GTC GAC GCC GGG CTG TGG GCA CGC GAT CAT TCC GAC GCG GTG ACC 3020 
Ala Val Asp Ala Gly Leu Trp Ala Arg Asp His Ser Asp Ala Val Thr 
280 285 290 

AGC CTG CAC GCC GCG AAC CTG GGC GTA TCG ACC CGA GCA GTA GGC CAG 3068 
Ser Leu His Ala Ala Asn Leu Gly Val Ser Thr Gly Ala Val Gly Gin 
295 300 305 

GGC TTC GGC GCC GAC TTC CAG CAG CGT CTG GTT CCA CGC CTG GAT CAC 3116 
Gly Phe Gly Ala Asp Phe Gin Gin Arg Leu Val Pro Arg Leu Asp His 
310 315 320 

GAC GCC CTC GCC CTC CTG GAG CGC ACA CAG CAA TTC CTG CTC ACC AAC 3164 
Asp Ala Leu Ala Leu Leu Glu Arg Thr Gin Gin Phe Leu Leu Thr Asn 
325 330 335 

AAC TTG CTG CAG GAA CCC GTC GCC CTC GAT CAG TGG GCG GCT CCG GAA 3212 
Asn Leu Leu Gin Glu Pro Val Ala Leu Asp Gin Trp Ala Ala Pro Glu 
340 345 350 355 

TTT CTG AAC AAC AGC CTC AAT CGC CAC CGA TAGGAACATC CGCATGACAC 3262 
Phe Leu Asn Asn Ser Leu Asn Arg His Arg 
360 365 



TGTCACCTGA 


AAAGCAGCAC 


GTTCGACCAC 


GCGACGCCGC 


CGACAACGAT 


CCCGTCGCGG 


3322 


TTGCCCGTGG 


GCTAGCCGAA 


AAGTGGCGAG 


CCACCGCCGT 


CGAGCGTGAT 


CGCGCCGGGG 


3362 


GTTCGGCAAC 


AG CCG AG CGC 


GAAGACCTGC 


GCGCGAGCGC 


GCTGCTGTCG 


CTCCTCGTCC 


3442 


CGCGCGAATA 


CGGCGGCTGG 


GGCGCAGACT 


GGCCCACCGC 


CATCGAGGTC 


GTCCGCGAAA 


3502 


TCGCGGCAGC 


CGATGGATCT 


TTGGGACACC 


TGTTCGGATA 


CCACCTCACC 


AACGCCCCGA 


3562 


TGATCGAACT 


GATCGGCTCG 


CAGGAACAAG 


AAGAACACCT 


GTACACCCAG 


ATCGCGCAGA 


3622 


ACAACTGGTG 


GACCGGAAAT 


GCCTCCAGCG 


AGAACAACAG 


CCACGTGCTG 


GACTGGAAGG 


3682 


TCAGCGCCAC 


CCCGACCGAA 


GACGGCGGCT 


ACGTGCTCAA 


TGGCACGAAG 


CACTTCTGCA 


3742 


GCGGCGCCAA 


GGGGTCGGAC 


CTGCTGTTCG 


TGTTCGGCGT 


CGTCCAGGAT 


GATTCTCCGC 


3802 


AGCAGGGTGC 


GATCATTGCT 


GCCGCTATCC 


CGACATCGCG 


GGCTGGCGTT 


ACGCCCAACG 


3862 


ACGACTGGGC 


CGCCATCGGC 


ATGCGGCAGA 


CCGACAGCGG 


TTCCACGGAC 


TTCCACAACG 


3922 


TCAAGGTCGA 


GCCTGACGAA 


GTGCTGGGCG 


CGCCCAACGC 


CTTCGTTCTC 


GCCTTCATAC 


3982 


AATCCGAGCG 


CGGCAGCCTC 


TTCGCGCCCA 


TAGCGCAATT 


GATCTTCGCC 


AACGTCTATC 


4042 


TGGGGATCGC 


GCACGGCGCA 


CTCGATGCCG 


CCAGGGAGTA 


CACCCGTACC 


CAGGCGAGGC 


4102 


CCTGGACACC 


GGCCGGTATT 


CAACAGGCAA 


CCGAGGATCC 


CTACACCATC 


CGCTCCTACG 


4162 


GTGAGTTCAC 


CATCGCATTG 


CAGGGAGCTG 


ACGCCGCCGC 


CCGTGAAGCG 


GCCCACCTGC 


4222 


TGCAGACGGT 


GTGGGACAAG 


GGCGACGCGC 


TCACCCCCGA 


GGACCGCGGC 


GAACTGATGG 


4282 
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TGAAGGTCTC 


GGGAGTCAAA 


GCGTTGGCCA 


CCAACGCCGC 


CCTCAACATC 


AGCAGCGGCG 


4342 


TCTTCGAGGT 


GATCGGCGCG 


CGCGGAACAC 


ATCCCAGGTA 


CGGTTTCGAC 


CGCTTCTGGC 


4402 


GCAACGTGCG 


CACCCACTCC 


CTGCACGACC 


CGGTGTCCTA 


CAAGATCGCC 


GACGTCGGCA 


4462 


AGCACACCTT 


GAACGGTCAA 


TACCCGATTC 


CCGGCTTCAC 


CTCCTGAGGA 


TCTGAGGCGC 


4522 


TGATCGAGGC 


CGAGGCCACC 


GCGCGGCCGA 


GTCGCGAATC 


GCCCGCCGAT 


ACTCAGCTTC 


4582 


TCCATACGTA 


CGGGTGCACA 


CAAGGAGATA 


TTGTCAAGAC 


CTGTGGATGA 


GGGTGTTTCA 


4642 


GGCGACCTCC 


GTTTCGCTTG 


ATTCGTCGGG 


CTCAGCGGGT 


GAGATGTCGA 


TGGGTCGTTC 


4702 


GAGCAGCTTG 


CCTTTGTGGA 


ACACCGCGCC 


GGCACGGACC 


AGCGCGACGA 


GATGGGGGGC 


4762 


GTTGACCGCC 


GCCAGCGjSGC 


TTGTGCGGCG 


TCGATCAGCT 


TGTAGGCCAT 


GGCAATCCCG 


4822 


CTGCGACGTG 


ACCCAGGGCC 


CTTGGTGACC 


TTGGTTCGCA 


ACCGCACGGT 


CGCAAACGTC 


4882 


GATTCGATCG 


GATTCGTAGT 


GCGCAAGTGG 


ATCCAGTGCT 


CGGCCGGGTA 


CCGGTAGAAC 


4942 


TCCAGGAGCA 


CGTCGGCGTC 


GTCGACGATC 


TTGGCGACCG 


CCTTGGGGTA 


CTTCGCGCCG 


5002 


TAATCTACCT 


CGAAGGCCTT 


GATCGCGACC 


TGGGCCTTGT 


CGATGTCCTC 


GGCGTTGTAG 


5062 


ATTTCCCGCA 


TCGCCGCGGT 


CGCACCTGGA 


TGAGCCGACT 


TGGGCAGCGC 


AGCAAGCACA 


5122 


TTGGCCTGCT 


TGTGAAACCA 


GCAGCGCTGT 


TCACGGGTAT 


CCGGAAACAC 


CTCCCGCAGT 


5182 


GCCTTCCAGA 


ACCCCAGCGC 


CCCATCACCG 


ACGGCCAGCA 


CCGGGGCGGT 


CATCCCGCGG 


5242 


CGTCGGCATG 


AGCGCAGCAG 


ATCAGCCCAC 


GACTCTGTGG 


ACTCCCGGAA 


CCCATCGGTG 


5302 


AGCGCGACGA 


GCTCCTTGCG 


GCCGTCGGCG 


CGGACGCCGA 


TCATCACGAG 


CAAGCACAGC 


5362 


TTCTCCTGCT 


CCAGGCGGAC 


ATTGAGATGG 


ATGCCGTCGA 


CCCATAGGTA 


CACGAAATCG 


5422 


GTGCCCGAGA 


GATCCCGGTC 


GGCGAAGGCC 


TTCGCCTCGT 


CCTGCCATTG 


CGCGGTCAGC 


5482 


CGGGTGATCG 


TCGAGGCCGA 


CAGCCCGGCA 


CCAGTGCCGA 


GG AACTGCTC 


CAA 


5535 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 365 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Thr Ser Arg Val Asp Pro Ala Asn Pro Gly Ser Glu Leu Asp Ser 
15 10 15 

Ala lie Arg Asp Thr Leu Thr Tyr Ser Asn Cys Pro Val Pro Asn Ala 
20 25 30 

Leu Leu Thr Ala Ser Glu Ser Gly Phe Leu Asp Ala Ala Gly lie Glu 
35 40 45 
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Leu Asp Val Leu Ser Gly Gin Gin Gly Thr Val His Phe Thr Tyr Asp 
50 55 60 

Gin Pro Ala Tyr Thr Arg Phe Gly Gly Glu lie Pro Pro Leu Leu Ser 
65 70 75 80 

Glu Gly Leu Arq Ala Pro Gly Arg Thr Arg Leu Leu Gly lie Thr Pro 
85 90 95 

Leu Leu Gly Arg Gin Gly Phe Phe Val Arg Asp Asp Ser Pro lie Thr 
100 105 HO 

Ala Ala Ala Asp Leu Ala Gly Arg Arg lie Gly Val Ser Ala Ser Ala 
115 120 125 

lie Arg He Leu Arg Gly Gin Leu Gly Asp Tyr Leu Glu Leu Asp Pro 
130 135 140 

Trp Arg Gin Thr Leu Val Ala Leu Gly Ser Trp Glu Ala Arg Ala Leu 
145 150 155 160 

Leu His Thr Leu Glu His Gly Glu Leu Gly Val Asp Asp Val Glu Leu 
165 170 175 

Val Pro He Ser Ser Pro Gly Val Asp Val Pro Ala Glu Gin Leu Glu 
180 185 190 

Glu Ser Ala Thr Val Lys Gly Ala Asp Leu Phe Pro Asp Val Ala Arg 
195 200 205 

Gly Gin Ala Ala Val Leu Ala Ser Gly Asp Val Asp Ala Leu Tyr Ser 
210 215 220 

Trp Leu Pro Trp Ala Gly Glu Leu Gin Ala Thr Gly Ala Arg Pro Val 
225 230 235 240 

Val Asp Leu Gly Leu Asp Glu Arg Asn Ala Tyr Ala Ser Val Trp Thr 
245 250 255 

Val Ser Ser Gly Leu Val Arg Gin Arg Pro Gly Leu Val Gin Arg Leu 
260 265 270 

Val Asp Ala Ala Val Asp Ala Gly Leu Trp Ala Arg Asp His Ser Asp 
275 280 285 

Ala Val Thr Ser Leu His Ala Ala Asn Leu Gly Val Ser Thr Gly Ala 
290 295 300 

Val Gly Gin Gly Phe Gly Ala Asp Phe Gin Gin Arg Leu Val Pro Arg 
305 * 310 315 320 

Leu Asp His Asp Ala Leu Ala Leu Leu Glu Arg Thr Gin Gin Phe Leu 
325 330 335 

Leu Thr Asn Asn Leu Leu Gin Glu Pro Val Ala Leu Asp Gin Trp Ala 
340 345 350 

Ala Pro Glu Phe Leu Asn Asn Ser Leu Asn Arg His Arg 
355 360 365 
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CIAIMS 

1. A recoiobinant DNA molecule containing a gene which 
encodes a biocatalyst capable of desulfurizing a 
fossil fuel which contains organic sulfur molecules. 

5 2. The recombinant DNA molecule of Claim 1 wherein said 
fossil fuel is petroleum. 

3. A recombinant DNA molecule containing a gene of a 
strain of Rhodococcus rhodochrous bacteria wherein 
said gene encodes a biocatalyst capable of desulfur- 

10 izing a fossil fuel which contains organic sulfur 

molecules. 

4. The recombinant DNA molecule of Claim 3 wherein the 
gene is derived from the IGTS8 strain of Rhodococcus 
rhodochrous bacteria. 

15 5. The recombinant DNA molecule of Claim 3 wherein said 
fossil fuel is petroleum. 

6. Purified DNA derived from a strain of Rhodococcus 
rhodochrous bacteria wherein said DNA encodes a 
biocatalyst capable of desulfurizing a fossil fuel 

20 which contains organic sulfur molecules. 

7. Purified DNA of Claim 6 wherein said DNA is derived 
from the IGTS8 strain of Rhodococcus rhodochrous 
bacteria. 
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8. Purified DNA derived from a strain of Rhodococcus 
rhodochrous bacteria wherein said DNA encodes a 
biocatalyst capable of desulfurizing petroleum which 
contains organic sulfur molecules. 

5 9. Purified DNA of Claim 8 wherein said DNA is derived 
from the I GTS 8 strain of Rhodococcus rhodochrous 
bacteria. 

10. A recombinant DNA vector containing a DNA molecule 
containing a gene which encodes a biocatalyst cap- 

10 able of desulfurizing a fossil fuel which contains 

organic sulfur molecules. 

11. A recombinant DNA vector containing a DNA molecule 
containing the gene of a strain of Rhodococcus rho- 
dochrous bacteria wherein said gene encodes a bio- 

15 catalyst capable of desulfurizing a fossil fuel 

which contains organic sulfur molecules. 

12. The recombinant DNA vector of Claim 11 wherein the 
gene is derived from the IGTS8 strain of Rhodococcus 
rhodochrous bacteria. 



20 13. A DNA plasmid vector containing a DNA derived from a 
microorganism wherein said DNA encodes a biocatalyst 
capable of desulfurizing a fossil fuel which con- 
tains organic sulfur molecules. 
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14 



15. 



16, 

10 



17. 



15 



18. 



20 



19. 



25 



A DNA plasmid vector containing a DNA derived fro* a 
strain of Bfeodococ c ns rhodochr ous bacteria wherexn 
said DNA encodes a biocatalyst capable of desulfur- 
izing a fossil fuel which contains organic sulfur 
molecules. 

The DNA plasmid vector of Claim 14 wherein the DNA 
is derived from the IGTS8 strain of Fhortococcus 
rhr>rinchrous bacteria. 

A DNA plasmid containing a recombinant DNA molecule 
containing the gene of a microorganism wherexn said 
gene encodes a biocatalyst capable of desulfurizing 
a fossil fuel which contains organic sulfur mole- 
cules . 

A DNA plasmid containing a recombinant DNA molecule 
containing the gene of a strain of pyococcus rho- 
^chrous bacteria wherein said gene encodes a bio- 
catalyst capable of desulfurizing a fossil fuel 
which contains organic sulfur molecules. 

The DNA plasmid of Claim 17 wherein said gene is 
derived from the IGTS8 strain of Phnrtocnccus rhodo- 
ehrous bacteria. 

A DNA plasmid containing an isolated DNA derived 
from a strain of pho^ororms rfrpdocftrous bacteria 
wherein said DNA encodes a biocatalyst capable of 
desulfurizing a fossil fuel which contains organic 
sulfur molecules. 
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20. The DNA plasmid of Claim 19 wherein the isolated DNA 
is derived from the IGTS8 strain of Rhodococcus 
rhodochrous bacteria. 

21. Plasmid pTOXI-1. 

5 22. Plasmid pTOXI-2. 

23. A microorganism containing a recombinant DNA plasmid 
containing a gene which expresses a biocatalyst 
capable of desulfurizing a fossil fuel which con- 
tains organic sulfur molecules. 

10 24. A microorganism containing a recombinant DNA plasmid 
containing the gene of a strain of Rhodococcus rho- 
dochrous bacteria wherein said microorganism ex- 
presses a biocatalyst capable of desulfurizing a 
fossil fuel which contains organic sulfur molecules. 

15 25. The microorganism of Claim 24 wherein the gene is 

derived from the I GTS 8 strain of Rhodococcus rhodo- 
chrous bacteria. 

26. A microorganism containing a DNA derived from a 

strain of Rhodococcus rhodochrous bacteria wherein 
20 said transformed microorganism expresses a bio- 

catalyst capable of desulfurizing a fossil fuel 
which contains organic sulfur molecules. 



25 



27. 



The microorganism of Claim 26 wherein the DNA plas- 
mid is derived from the IGTS8 strain of Rhodococcus 
rhodochrous bacteria. 
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Complemented host organism CPE-648 containing plas- 
mid pTOXI-1. 



29. Complemented host organism CPE-648 containing plas- 
mid pTOXI-2. 

5 30. A method of desulfurizing a. fossil fuel which con- 
tains organic sulfur molecules through the use of an 
organism containing a recombinant DNA plasmid con- 
taining a gene which expresses a biocatalyst capable 
of desulfurizing a fossil fuel which contains or- 

10 ganic sulfur molecules, comprising the steps of: 

a) contacting the fossil fuel with the micro- 
organism; and 

b) incubating the fossil fuel and micro- 
organism mixture under conditions suf- 
ficient to bring about the catalytic clea- 
vage of organic carbon-sulfur bonds, 

whereby the organic sulfur content of the fossil 
fuel is significantly reduced. 

A method of desulfurizing a fossil fuel which con- 
20 tains organic sulfur molecules through the use of a 

microorganism containing a recombinant DNA plasmid 
containing the gene of a strain of pyococcus rho- 
dochrous bacteria wherein said microorganism ex- 
presses a biocatalyst capable of desulfurizing a 
25 fossil fuel which contains organic sulfur molecules, 

comprising the steps of: 

a) contacting the fossil fuel with the micro- 
organism; and 
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b) incubating the fossil fuel and micro- 
organism mixture under conditions suf- 
ficient to bring about the catalytic clea- 
vage of organic carbon-sulfur bonds, 
5 whereby the organic sulfur content of the fossil 

fuel is significantly reduced. 

32. The transformed microorganism of Claim 31 wherein 

the gene is derived from the IGTS8 strain of Rhodo- 
coccus rhodochrous bacteria. 

10 33. A method of desulfurizing a fossil fuel which con- 
tains organic sulfur molecules through the use of a 
microorganism containing a DNA plasmid containing 
DNA derived from a strain of Rhodococcus rhodochrous 
bacteria wherein said microorganism expresses a 
15 biocatalyst capable of desulfurizing a fossil fuel 

which contains organic sulfur molecules, comprising 
the steps of: 

a) contacting the fossil fuel with said mi- 
croorganism; and 
20 b) incubating the fossil fuel and transformed 

microorganism mixture under conditions 
sufficient to bring about the catalytic 
cleavage of organic carbon-sulfur bonds, 
whereby the organic sulfur content of the fossil 
25 fuel is significantly reduced. 

34. The method of desulfurizing a fossil fuel of Claim 
33 wherein the fossil fuel is petroleum. 
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The method of desulfurizing a fossil fuel of Claim 
3 3 wherein the DNA is derived from the I GTS 8 strain 
of Rhodococcus rhodochrous bacteria. 

A nucleic acid probe capable of hybridizing to all 
or a portion of a recombinant DNA molecule contain- 
ing the gene of a biocatalyst capable of desulfur- 
izing a fossil fuel which contains organic sulfur 
molecules. 



37. A nucleic acid probe capable of hybridizing to all 
10 or a portion of a recombinant DNA molecule contain- 

ing the gene of a strain of Rhodococcus rhodochrous 
bacteria wherein said gene encodes a biocatalyst 
capable of desulfurizing a fossil fuel which con- 
tains organic sulfur molecules, 

15 38. A nucleic acid probe capable of hybridizing to a all 
or a portion of a DNA derived from a strain of Rho- 
dococcus rhodochrous bacteria wherein said DNA en- 
codes a biocatalyst capable of desulfurizing a fos- 
sil fuel which contains organic sulfur molecules, 

20 39. The nucleic acid sequence set forth in Sequence No. 
1. 

40. The nucleic acid sequence set forth in Sequence No. 
2. 



25 



41. 



The nucleic acid sequence set forth in Sequence No. 
3. 
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10 49. 
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The nucleic acid sequence set forth in Sequence No. 
4. 

The nucleic acid sequence set forth in Sequence No. 
5. 

The DNA sequence for ORF-1. 

The DNA sequence for ORF-2 . 

The DNA sequence for ORF-3 . 

The amino acid sequence for ORF-1. 

The amino acid sequence for ORF-2. 

The amino acid sequence for ORF-3. 

The DNA sequences set forth starting on page 66 
through page 80. 
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AMENDED CLAIMS 
[received by the International Bureau on 4 February 1994 (04.02.94); 
original claims 39-43 amended; remaining claims unchanged (2 pages)] 

35 . The method of desulfurizing a fossil fuel of Claim 3 3 
wherein the DNA is derived from the I GTS 8 strain of 
Rhodococcus rhodoch r ous bacteria. 

36. A nucleic acid probe capable of hybridizing to all or 
5 a portion of a recombinant DNA molecule containing 

the gene of a biocatalyst capable of desulfurizing a 
fossil fuel which contains organic sulfur molecules. 

37. A nucleic acid probe capable of hybridizing to all or 
a portion of a recombinant DNA molecule containing 

10 the gene of a strain of Rhodococcus rhodochrous 

bacteria wherein said gene encodes a biocatalyst 
capable of desulfurizing a fossil fuel which contains 
organic sulfur molecules. 

38. A nucleic acid probe capable of hybridizing to a all 
15 or a portion of a DNA derived from a strain of 

Rhodococcus rhodochrous bacteria wherein said DNA 
encodes a biocatalyst capable of desulfurizing a 
fossil fuel which contains organic sulfur molecules. 

39. The nucleic acid sequence set forth in SEQ ID NO: 1. 
20 40. The nucleic acid sequence set forth in SEQ ID NO: 2. 

41. The nucleic acid sequence set forth in SEQ ID NO: 3. 

42. The nucleic acid sequence set forth in SEQ ID NO: 4. 

43. The nucleic acid sequence set forth in SEQ ID NO: 5. 



44. 



The DNA sequence for ORF-1. 
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45. The DNA sequence for ORF-2. 

46. The DNA sequence for ORF-3. 

47. The amino acid sequence for 0RF-1. 
5 48. The amino acid sequence for ORF-2. 

49, The amino acid sequence* for ORF-3. 

50. The DNA sequences set forth starting on page 66 
through page 80. 
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